Fungal aureobasidin sensitivity gene products

ABSTRACT

A gene associated with sensitivity to the antimycotic agent, aureobasidin A, has been isolated. The gene can be detected in a variety of cell types, and variant forms of the gene have been identified in mutant yeast strains. The proteins encoded by these genes are useful in diagnosing and treating mycoses.

This is a divisional application of Ser. No. 08/243,403, filed May 16,1994, now abandoned.

BACKGROUND OF THE INVENTION

1. Field of the Invention

This invention relates to a protein regulating the sensitivity to anantimycotic aureobasidin and a gene coding for this protein, namely, agene coding for a protein regulating aureobasidin sensitivity. Thepresent invention further relates to a series of the uses of the proteinand the gene. Furthermore, it relates to an antibody against thisprotein and the use of the same.

2. Description of Related Art

Systemic mycoses including candidiasis have increased with an increasein immunocompromised patients in recent years due to, for example, theextended use of immunosuppressive drugs and acquired immunodeficiencysyndrome (AIDS), and as opportunistic infection due to microbialsubstitution caused by the frequent use of widespectrum antibacterialantibiotics. Although drugs for treating mycoses such as amphotericin B,flucytosine and azole drugs (for example, fluconazole and miconazole)are now used to cope with this situation, none of them can achieve asatisfactory effect. Also, known diagnostic drugs are insufficient. Forcandidiasis, in particular, although there have been known severaldiagnostic drugs (for example, CAND-TEC for detection of candida antigenand LABOFIT for detection of D-arabinitol),none of them gives anysatisfactory results in specificity or sensitivity.

The reasons for the delay in the development of remedies and diagnosticdrugs for mycoses as described above are that fungi causing the mycosesare eukaryotic organisms similar to the host (i.e., man) and thus notlargely different from man and that knowledge of fungi, in particular,pathogenic fungi is insufficient. Therefore it is difficult todistinguish fungi from man or to selectively kill fungi, which isresponsible for the delay in the development of drugs for mycoses.

Recently the application of genetic engineering techniques such asantisense or PCR to the treatment and diagnosis of mycoses has beenexpected. However known genes which are applicable thereto and/orproteins coded for by these genes are rare (PCT Pamphlet WO92/03455).Regarding pathogenic fungi, there have been cloned in recent years anacid protease gene, which has been assumed to participate in thepathogenicity of Candida albicans (hereinafter referred to simply as C.albicans) and Candida tropicalis (hereinafter referred to as C.tropicalis) causing candidiasis [B. Hube et al., J. Med. Vet. Mycol.,29, 129-132 (1991); Japanese Patent Laid-Open No. 49476/1993; and G.Togni et al., FEBS Letters, 286, 181-185 (1991)], a calmodulin gene ofC. albicans [S. M. Saporito et al., Gene, 106, 43-49 (1991)] and aglycolytic pathway enzyme enolase gene of C. albicans [P. Sundstrom etal., J. Bacteriology, 174, 6789-6799 (1991)]. However, each of thesegenes and proteins coded for thereby is either indistinguishable fromnonpathogenic fungi and eukaryotic organisms other than fungi or, ifdistinguishable therefrom, cannot serve as a definite action point forexhibiting any selective toxicity.

Aureobasidin [Japanese Patent Laid-Open No. 138296/1990, No. 22995/1991,No. 220199/1991 and No. 279384/1993, Japanese Patent Application No.303177/1992, J. Antibiotics, 44 (9), 919-924, ibid., 44 (9), 925-933,ibid., 44 (11), 1187-1198 (1991)] is a cyclic depsipeptide obtained as afermentation product of a strain Aureobasidium pullulans No. R106. It iscompletely different in structure from other antimycotics. As Tables 1and 2 show, aureobasidin A, which is a typical aureobasidin compound,exerts a potent antimycotic activity on various yeasts of the genusCandida including C. albicans which is a pathogenic fungus, Cryptococcusneoformans, Histoplasma capsulatum, Blastomyces dermatitidis and fungiof the genus Aspergillus (Japanese Patent Laid-Open No. 138296/1990) buthas an extremely low toxicity in mammal. Thus this compound is expectedto be useful as an antimycotic being excellent in selective toxicity.

Hereinafter, Candida, Cryptococcus and Aspergillus will be abbreviatedrespectively as C., Cr. and A.

                  TABLE 1                                                         ______________________________________                                        Test strain      TIMM No.  MIC (μg/ml)                                     ______________________________________                                        C. albicans      0136      ≦0.04                                         C. albicans var. stellatoidea 1308 ≦0.04                               C. tropicalis 0312 0.08                                                       C. kefyr 0298 0.16                                                            C. parapsilosis 0287 0.16                                                     C. krusei 0270 ≦0.04                                                   C. guilliermondii 0257 0.08                                                   C. glabrata 1062 ≦0.04                                                 Cr. neoformans 0354 0.63                                                      Cr. terreus 0424 0.31                                                         Rhodotorula rubra 0923 0.63                                                   A. fumigatus 0063 20                                                          A. clavatus 0056 0.16                                                       ______________________________________                                    

                  TABLE 2                                                         ______________________________________                                        Test strain       TIMM No.  MIC (μg/ml)                                    ______________________________________                                        A. niduians       0112      0.16                                                A. terreus 0120 5                                                             Peniciliium commune 1331 1.25                                                 Trichophyton mentagrophytes 1189 10                                           Epidermophyton floccosum 0431 2.5                                             Fonsecaea pedrosoi 0482 0.31                                                  Exophiala wemeckii 1334 1.25                                                  Cladosporium bantianum 0343 0.63                                              Histoplasma capsuiatuin 0713 0.16                                             Paracoccidioides brasiliensis 0880 0.31                                       Geotrichum candidum 0694 0.63                                                 Blastomyces dermatitidis 0126 0.31                                          ______________________________________                                    

Each of the conventional antimycotics with a weak toxicity shows only afungistatic effect, which has been regarded as a clinical problem. Incontrast, aureobasidin has a fungicidal effect. From this point of view,it has been urgently required to clarify the mechanism of the selectivetoxicity to fungi of aureobasidin. However this mechanism still remainsunknown.

Under these circumstances, the present invention aims at finding a novelprotein regulating aureobasidin sensitivity through the clarification ofthe mechanism of the selective toxicity to fungi of aureobasidin.Accordingly, the present invention aims at finding a gene coding for aprotein regulating aureobasidin sensitivity, providing a process forcloning this gene and the protein regulating aureobasidin sensitivitywhich is encoded by this gene, further providing an antisense DNA and anantisense RNA of this gene, providing a nucleic acid probe beinghybridizable with this gene, providing a process for detecting this genewith the use of the nucleic acid probe, providing a process forproducing the protein regulating aureobasidin sensitivity by using thisgene and providing an antibody against the protein regulatingaureobasidin sensitivity, and a process for detecting the proteinregulating aureobasidin sensitivity by using this antibody.

SUMMARY OF THE INVENTION

The present invention may be summarized as follows. Namely, the firstinvention of the present invention relates to an isolated gene codingfor a protein regulating aureobasidin sensitivity, that is, a generegulating aureobasidin sensitivity. The second invention relates to aprocess for cloning a gene regulating aureobasidin sensitivity which ischaracterized by using the gene regulating aureobasidin sensitivity ofthe first invention or a part thereof as a probe. The third inventionrelates to a nucleic acid probe which is hybridizable with a generegulating aureobasidin sensitivity and comprises a sequence consistingof 15 or more bases. The fourth invention relates to an antisense DNA ofa gene regulating aureobasidin sensitivity. The fifth invention relatesto an antisense RNA of a gene regulating aureobasidin sensitivity. Thesixth invention relates to a recombinant plasmid having a generegulating aureobasidin sensitivity contained therein. The seventhinvention relates to a transformant having the above-mentioned plasmidintroduced thereinto. The eighth invention relates to a process forproducing a protein regulating aureobasidin sensitivity by using theabove-mentioned transformant. The ninth invention relates to an isolatedprotein regulating aureobasidin sensitivity. The tenth invention relatesto an antibody against a protein regulating aureobasidin sensitivity.The eleventh invention relates to a process for detecting a proteinregulating aureobasidin sensitivity by using the above-mentionedantibody. The twelfth invention relates to a process for detecting agene regulating aureobasidin sensitivity by the hybridization which ischaracterized by using the nucleic acid probe of the third invention ofthe present invention. The thirteenth invention relates to a process forscreening an antimycotic by using the above-mentioned transformant or aprotein regulating aureobasidin sensitivity.

The present inventors have found out that fungi such asSchizosaccharomyces pombe (hereinafter referred to simply as Schizo.pombe) and Saccharomyces cerevisiae (hereinafter referred to simply asS. cerevisiae) and, further, mammalian cells such as mouse lymphoma EL-4cells are sensitive to aureobasidin, as Table 3 shows.

                  TABLE 3                                                         ______________________________________                                        Test strain or cell                                                                             MIC (μg/ml)                                              ______________________________________                                        Schizo. pombe     0.08                                                          S. cerevisiae 0.31                                                            mouse lymphoma EL-4 10                                                        mouse lymphoma L5178Y 100                                                     NRK-49F 12.5                                                                ______________________________________                                    

The present inventors have mutagenized a wild-type strain of Schizo.pombe or S. cerevisiae, sensitive to aureobasidin, to thereby giveresistant mutants. We have further successfully isolated a gene capableof confering aureobasidin resistance (a resistant gene) from theseresistant mutants and another gene capable of imparting aureobasidinsensitivity (a sensitive gene) from the corresponding sensitive cells.Furthermore, we have disclosed the existence of a protein encoded byeach of these genes. By culturing cells which have been transformed byintroducing the above-mentioned gene, we have succeeded in theexpression of this gene. Furthermore, we have successfully found out anovel gene regulating aureobasidin sensitivity from another fungus beingsensitive to aureobasidin by using a DNA fragment of the above-mentionedgene as a probe. In addition, we have clarified that the gene regulatingaureobasidin sensitivity is essentially required for the growth of thecells and found out that the detection of this gene or a protein whichis a gene product thereof with an antibody enables the diagnosis ofdiseases caused by these cells, for example, mycoses induced by fungi,and that an antisense DNA or an antisense RNA, which inhibits theexpression of the gene regulating aureobasidin sensitivity beingcharacteristic to the cells, is usable as a remedy for diseases causedby these cells, for example, mycoses induced by fungi, thus completingthe present invention.

That is to say, pathogenic fungi listed in Tables 1 and 2 and fungi andmammalian cells listed in Table 3, each having a sensitivity toaureobasidin, each carries a protein regulating aureobasidin sensitivityand a gene coding for this protein. The term "a protein regulatingaureobasidin sensitivity" as used herein means a protein which iscontained in an organism having a sensitivity to aureobasidin. Thisprotein is required for the expression of the sensitivity or resistanceto aureobasidin. As a matter of course, a protein having 35% or morehomology with the above-mentioned protein and having a similar functionis also a member of the protein regulating aureobasidin sensitivityaccording to the present invention. Furthermore, proteins obtained bymodifying these proteins by the genetic engineering procedure aremembers of the protein regulating aureobasidin sensitivity according tothe present invention. A gene regulating aureobasidin sensitivity meansa gene which codes for such a protein regulating aureobasidinsensitivity as those described above and involves both sensitive genesand resistant genes.

The first invention of the present invention relates to a generegulating aureobasidin sensitivity. This gene can be isolated in thefollowing manner. First, aureobasidin sensitive cells (a wild-typestrain) is mutagenized to thereby induce a resistant strain. Fromchromosome DNA or cDNA of this resistant strain, a DNA library isprepared and a gene capable of confering a resistance (a resistant gene)is cloned from this library. Then a DNA library of a wild strain isprepared and a DNA molecule being hybridizable with the resistant geneis isolated from this library and cloned. Thus a sensitive gene can beisolated.

The mutagenesis is performed by, for example, treating with a chemicalsuch as ethylmethane sulfonate (EMS) orN-methyl-N'-nitro-N-nitrosoguanidine (MNNG) or by ultraviolet or otherradiation. The cell that has acquired the resistance can be screened byculturing the mutagenized cells in a nutritional medium containingaureobasidin at an appropriate concentration under appropriateconditions. The resistant strain thus obtained may vary depending on themethod and conditions selected for the mutagenesis. Also, strainsdiffering in the extent of resistance from each other can be separatedby changing the aureobasidin concentration or a temperature-sensitiveresistant strain can be isolated by changing the temperature in the stepof screening. There are a number of mechanisms of resistance toaureobasidin. Accordingly, a number of resistant genes can be isolatedby genetically classifying these various resistant strains. In the caseof a yeast, the classification may be performed by the complementationtest. Namely, resistant strains are prepared from haploid cells. Next,diploid cells can be obtained by crossing resistant strains differing inmating type from each other. Then spores formed from these diploids areexamined by the tetrad analysis.

As typical examples of the genes regulating aureobasidin sensitivity(named aur) according to the present invention, aur1 and aur2 genes maybe cited. Typical examples of the aur1 gene include spaur1 gene isolatedfrom Schizo. pombe and scaur1 gene isolated from S. cerevisiae, whiletypical examples of the aur2 gene include scaur2 gene isolated from S.cerevisiae. Now, resistant genes (spaur1^(R), scaur1^(R) and scaur2^(R))isolated from resistant mutants by the present inventors and sensitivegenes (spaur1^(S), scaur1^(S) and scaur2^(S)) isolated from sensitivewild-type strains will be described.

FIG. 1 shows a restriction enzyme map of the genes spaur1^(R) andspaur1^(S) regulating aureobasidin sensitivity, FIG. 2 shows arestriction enzyme map of scaur1^(R) and scaur1^(S) and FIG. 3 shows arestriction enzyme map of scaur2^(R) and scaur2^(S).

Schizo. pombe, which is sensitive to aureobasidin, is mutagenized withEMS and a genomic library of the resistant strain thus obtained isprepared. From this library, a DNA fragment containing a resistant gene(spaur1^(R)) and having the restriction enzyme map of FIG. 1 isisolated. This gene has a nucleotide sequence represented by SEQ ID No.1 in Sequence Listing. The amino acid sequence of a protein encoded bythis gene, which is deduced on the basis of this nucleotide sequence, isthe one represented by SEQ ID No. 2 in Sequence Listing. By thehybridization with the use of this resistant gene as a probe, a DNAfragment containing a sensitive gene (spaur1^(S)) and having therestriction enzyme map of FIG. 1 is isolated from a sensitive strain.This gene has a nucleotide sequence represented by SEQ ID No. 3 inSequence Listing. The amino acid sequence of a protein encoded by thisgene, which is estimated on the basis of this nucleotide sequence, isthe one represented by SEQ ID No. 4 in Sequence Listing. A comparisonbetween the sequences of SEQ ID No. 3 and SEQ ID No. 1 reveals that amutation from G to T occurs at the base at the position 1053, while acomparison between the sequences of SEQ ID No. 4 and SEQ ID No. 2reveals that glycine at the residue 240 is converted into cysteine atthe amino acid level, thus giving rise to the resistance.

Also, S. cerevisiae, which is sensitive to aureobasidin, is mutagenizedwith EMS and genomic libraries of two resistant strains thus obtainedare prepared. From one of these libraries, a DNA fragment containing aresistant gene (scaur1^(R)) as a dominant mutant and having therestriction enzyme map of FIG. 2 is isolated, while a DNA fragmentcontaining a resistant gene (scaur2^(R)) and having the restrictionenzyme map of FIG. 3 is isolated from another library.

The nucleotide sequence of the coding region for the protein of thescaur1^(R) gene is the one represented by SEQ ID No. 5 in SequenceListing. The amino acid sequence of the protein encoded by this gene,which is estimated on the basis of the above nucleotide sequence, is theone represented by SEQ ID No. 6 in Sequence Listing. By thehybridization with the use of this resistant gene scaur1^(R) as a probe,a DNA fragment containing a sensitive gene (scaur1^(S)) and having therestriction enzyme map of FIG. 2 is isolated from a sensitive strain.This gene has a nucleotide sequence represented by SEQ ID No. 7 inSequence Listing. The amino acid sequence of a protein encoded by thisgene, which is estimated on the basis of this nucleotide sequence, isthe one represented by SEQ ID No. 8 in Sequence Listing. A comparisonbetween the sequences of SEQ ID No. 7 and SEQ ID No. 5 reveals that amutation from T to A occurs at the base at the position 852, while acomparison between the sequences of SEQ ID No. 8 and SEQ ID No. 6reveals that phenylalanine at the residue 158 is converted into tyrosineat the amino acid level, thus giving rise to the resistance. The spaur1gene has a 58% homology with the scaur1 gene at the amino acid level.Thus it is obvious that they are genes coding for proteins havingsimilar functions to each other. When genes and proteins beinghomologous in sequence with the spaur1 and scaur1 genes and with theproteins encoded thereby are searched from a data base, none having ahomology of 35% or above is detected. Accordingly, it is clear thatthese genes and the proteins encoded thereby are novel molecules whichhave never been known hitherto. By the hybridization with the use of theDNA fragment of the resistant gene scaur2^(R) as a probe, a DNA fragmentcontaining a sensitive gene (scaur2^(S)) and having the restrictionenzyme map of FIG. 3 is isolated from a sensitive strain.

The nucleotide sequence of this sensitive gene is the one represented bySEQ ID No. 9 in Sequence Listing and the amino acid sequence of theprotein encoded by this gene, which is estimated on the basis of thisnucleotide sequence, is the one represented by SEQ ID No. 10 in SequenceListing. As the result of the homology search with the scaur2^(S) geneand the protein encoded thereby, it has been found out that cysticfibrosis transmembrane conductance regulator (CFTR) of mammals alone hasa homology as low as 31%. Compared with this CFTR, however, the parthaving a high homology is limited to the region around the domain of thenucleotide binding. It is therefore obvious that the protein encoded bythe scaur2^(S) gene is a protein which is completely different from CFTRin function and has never been known hitherto.

In order to prove the importance of the aur1 gene in the growth ofcells, genes for disrupting the aur1 as shown in FIG. 4 and FIG. 5, inwhich genes coding for orotidine-5'-phosphate decarboxylase (ura4⁻ inthe case of Schizo. pombe, while URA3 in the case of S. cerevisiae) havebeen introduced midway in the aur1 gene, are prepared. When these aur1disrupted genes are introduced into Schizo. pombe and S. cerevisiaerespectively, the cells having the aur1 disrupted genes cannot grow atall. Thus it has been revealed that these genes and the proteins encodedthereby are essentially required for the growth of the yeast cells.

As the above examples clearly show, a gene regulating aureobasidinsensitivity can be isolated by using a organism having sensitivity toaureobasidin as a starting material and by carrying out the cloning withthe use of various mutagenesis methods and/or screening methodsdepending on the organisms or the methods. Also, genes beinghybridizable with the above-mentioned genes are involved in the scope ofthe first invention of the present invention. A gene regulatingaureobasidin sensitivity can be isolated by the following method. Thegenomic DNA library of an organism having sensitivity to aureobasidin isintegrated into, for example, a high-expression vector of a yeast andtransformed into the yeast. Then a clone having aureobasidin resistanceis selected from the transformants and DNA is recovered from this clone.Thus the resistant gene can be obtained. As a matter of course, genesobtained by modifying some part of the gene regulating aureobasidinsensitivity thus obtained by some chemical or physical methods areinvolved in the scope of the first invention of the present invention.

The second invention of the present invention relates to a process forcloning a gene regulating aureobasidin sensitivity which ischaracterized by using the gene regulating aureobasidin sensitivity ofthe first invention of the present invention or a part thereof as aprobe. Namely, by screening by the hybridization method or thepolymerase chain reaction (PCR) method with the use of a part(consisting of at least 15 oligonucleotides) or the whole of the gene asobtained above, a gene coding for a protein having a similar functioncan be isolated.

For example, a pair of primers of SEQ ID No. 11 and SEQ ID No. 12 inSequence Listing are synthesized on the basis of the DNA nucleotidesequence of the spaur1^(R) gene represented by SEQ ID No. 1. Then PCR isperformed by using cDNA of C. albicans, which is a pathogenic fungus, asa template with the use of the above-mentioned primers. The PCR iscarried out and the PCR products are electrophoresed on an agarose geland stained with ethidium bromide. In FIG. 6, the lanes 1, 2 and 3 showthe results obtained by using cDNA of C. albicans, cDNA of S. cerevisiaeand cDNA of Schizo. pombe as a template, respectively. As shown in FIG.6, a certain DNA fragment is specifically amplified.

By screening the genomic DNA library of C. albicans with the use of thisDNA fragment as a probe, a DNA molecule having a gene (caaur1), whichhas the same function as that of the spaur1 and scaur1 genes and havingthe restriction enzyme map of FIG. 7 is obtained. The nucleotidesequence of this caaur1 gene is the one represented by SEQ ID No. 13 inSequence Listing and the amino acid sequence of the protein encoded bythis gene, which has been estimated on the basis of the above nucleotidesequence, is the one represented by SEQ ID No. 14 in Sequence Listing.It has a high homology with the proteins encoded by the spaur1 andscaur1 genes.

By screening the genomic DNA library of C. albicans with the use of aDNA fragment comprising the whole length or a part of the scaur2^(S)gene represented by SEQ ID No. 9 in Sequence Listing as a probe, a DNAfragment containing gene (caaur2), which has the same function as thatof the scaur2 gene, and having the restriction enzyme map of FIG. 8 isobtained. The nucleotide sequence of a part of this caaur2 gene is theone represented by SEQ ID No. 15 in Sequence Listing and the amino acidsequence of the region encoded by this gene, which has been estimated onthe basis of this nucleotide sequence, is the one represented by SEQ IDNo. 16 in Sequence Listing. It has a high homology with thecorresponding region of the protein encoded by the scaur2 gene.

The third invention of the present invention relates to anoligonucleotide comprising 15 or more bases which serves as theabove-mentioned nucleic acid probe and is hybridizable with the generegulating aureobasidin sensitivity, for example, the DNA fragmenthaving the restriction enzyme map as shown in FIG. 1, FIG. 2 or FIG. 3.This nucleic acid probe is usable in, for example, the hybridization insitu, the identification of a tissue wherein the above-mentioned genecan be expressed, and the confirmation of the presence of a gene or mRNAin various vital tissues. This nucleic acid probe can be prepared byligating the above-mentioned gene or a gene fragment to an appropriatevector, introducing it into a bacterium, allowing it to replicate in thebacterium, extracting from a disrupted cell suspension, cleaving with arestriction enzyme capable of recognizing the vector-ligating site,electrophoresing and then excising from the gel. Alternatively, thisnucleic acid probe can be constructed by the chemical synthesis with theuse of a DNA synthesizer or gene amplification techniques by PCR on thebasis of the nucleotide sequence of SEQ ID. Nos. 1, 3, 5, 7, 9, 13, 15or 21 in Sequence Listing. This nucleic acid probe can be labeled with aradioisotope or a fluorescent substance to thereby elevate the detectionsensitivity at the use.

The fourth invention of the present invention relates to an antisenseDNA of the above-mentioned gene regulating aureobasidin sensitivity,while the fifth invention of the present invention relates to anantisense RNA thereof. The introduction of the antisense DNA orantisense RNA into cells makes it possible to control the expression ofthe gene regulating aureobasidin sensitivity.

As examples of the antisense DNA to be introduced, antisense DNAscorresponding to the genes regulating aureobasidin sensitivity of SEQ IDNos. 1, 3, 5, 7, 9, 13, 15 or 21 in Sequence Listing and some partsthereof may be cited. SEQ ID No. 17 in Sequence Listing shows an exampleof this antisense DNA. It represents the sequence of an antisense DNA ofthe gene regulating aureobasidin sensitivity of SEQ ID No. 1 in SequenceListing. A fragment obtained by appropriately cleaving some part of suchan antisense DNA, and a DNA synthesized depending on such an antisenseDNA sequence may be used as the antisense DNA.

As examples of the antisense RNA to be introduced, antisense RNAscorresponding to the genes regulating aureobasidin sensitivity of SEQ IDNos. 1, 3, 5, 7, 9, 13, 15 or 21 in Sequence Listing and some partsthereof may be cited. SEQ ID No. 18 in Sequence Listing shows an exampleof this antisense RNA. It represents the sequence of an antisense RNA ofthe gene regulating aureobasidin sensitivity of SEQ ID No. 1 in SequenceListing. A fragment obtained by appropriately cleaving some part of suchan antisense RNA, an RNA synthesized depending on such an antisense RNAsequence, and an RNA prepared with RNA polymerase in an in vitrotranscription system by using the DNA corresponding to the generegulating aureobasidin sensitivity of SEQ ID No. 1 or SEQ ID No. 3 inSequence Listing or a part thereof may be used as the antisense RNA.

These antisense DNA and antisense RNA may be chemically modified so asto prevent degradation in vivo or to facilitate passage through a cellmembrane. A substance capable of inactivating mRNA, for example,ribozyme may be linked thereto. The antisense DNA and antisense RNA thusprepared are usable in the treatment of various diseases such as mycosesaccompanied by an increase in the amount of mRNA coding for a proteinregulating aureobasidin sensitivity.

The sixth invention of the present invention relates to a recombinantplasmid having a gene coding for a protein regulating aureobasidinsensitivity being integrated into an appropriate vector. For example, aplasmid, in which a gene regulating aureobasidin sensitivity gene hasbeen integrated into an appropriate yeast vector, is highly useful as aselection marker gene, since a transformant can be easily selectedthereby with the guidance of the chemical resistance by usingaureobasidin.

Also, the recombinant plasmid can be stably carried by, for example,Escherichia coli. Examples of vectors which are usable in this caseinclude pUC118, pWH5, pAU-PS, Traplexll9 and pTV118. pAU-PS having thespaur1^(S) gene integrated therein is named pSPAR1. pWH5 having thespaur1^(S) gene integrated therein is named pSCAR1. pWH5 having thescaur2^(R) gene integrated therein is named pSCAR1. Traplexll9 vectorhaving the caaur1 gene integrated therein is named pCAAR1. pTV118 vectorhaving a part of the caaur2 gene integrated therein is named pCAAR2N.Each of these recombinant plasmids is transformed into E. coli. It isalso possible to express these plasmids in an appropriate host. Such agene is reduced exclusively into the open reading frame (ORF) to betranslated into a protein by cleaving with an appropriate restrictionenzyme, if necessary, and then bound to an appropriate vector. Thus anexpression recombinant plasmid can be obtained. When E. coli is used asthe host, plasmids such as pTV118 may be used as a vector for theexpression plasmid. When a yeast is used as the host, plasmids such aspYES2 may be used as the vector. When mammalian cells are used as thehost, plasmids such as pMAMneo may be used as the vector.

The seventh invention of the present invention relates to a transformanthaving the above-mentioned recombinant plasmid which has been introducedinto an appropriate host. As the host, E. coli, yeasts and mammaliancells are usable. E. coli JM109 transformed by pSPAR1 having thespaur1^(S) gene integrated therein has been named and designated asEscherichia coli JM109/pSPAR1 and deposited at National Institute ofBioscience and Human-Technology, Agency of Industrial Science andTechnology (1-3, Higashi 1 chome Tsukuba-shi Ibaraki-ken 305, JAPAN), inaccordance with the Budapest Treaty under the accession number FERMBP-4485. E. coli HB101 transformed by pSCAR1 having the scaur1^(S) geneintegrated therein has been named and designated as Escherichia coliHB101/pSCAR1 and deposited at National Institute of Bioscience andHuman-Technology, Agency of Industrial Science and Technology inaccordance with the Budapest Treaty under the accession number FERMBP-4483. E. coli HB101 transformed by pSCAR2 having the scaur2^(R) geneintegrated therein has been named and designated as Escherichia coliHB101/pSCAR2 and deposited at National Institute of Bioscience andHuman-Technology, Agency of Industrial Science and Technology inaccordance with the Budapest Treaty under the accession number FERMBP-4484. E. coli HB101 transformed by pCCAR1 having the caaur1^(S) geneintegrated therein has been named and designated as Escherichia coliHB101/pCAAR1 and deposited at National Institute of Bioscience andHuman-Technology, Agency of Industrial Science and Technology inaccordance with the Budapest Treaty under the accession number FERMBP-4482. E. coli HB101 transformed by pCAAR2N having a part of thecaaur2 gene integrated therein has been named and designated asEscherichia coli HB101/pCAAR2N and deposited at National Institute ofBioscience and Human-Technology, Agency of Industrial Science andTechnology in accordance with the Budapest Treaty under the accessionnumber FERM BP-4481.

A transformant capable of expressing a protein regulating aureobasidinsensitivity can be obtained by transforming a expression recombinantplasmid into an appropriate host, as described above. For example, ayeast having a recombinant plasmid as shown in FIG. 9 introducedthereinto is usable for this purpose.

The eighth invention of the present invention relates to a process forproducing a protein regulating aureobasidin sensitivity which comprisesincubating a transformant according to the sixth invention of thepresent invention, which contains a gene coding for this protein, in anappropriate nutritional medium, allowing the expression of the protein,then recovering the protein from the cells or the medium and purifyingthe same. For the expression of the gene coding for this protein, E.coli, a yeast or mammalian cells are employed as a host. When the yeasthaving the recombinant plasmid of FIG. 9 is incubated in a mediumcontaining galactose, for example, the protein regulating aureobasidinsensitivity which is encoded by the scaur1^(S) gene can be expressed.

The ninth invention of the present invention relates to an isolatedprotein regulating aureobasidin sensitivity. As examples of such aprotein, those encoded by the above-mentioned spaur1, scaur1, scaur2,caaur1 and caaur2 genes can be cited.

The spaur1^(S) gene codes for a protein having an amino acid sequencerepresented by SEQ ID No. 4 in Sequence Listing, while the scaur1^(S)gene codes for a protein having an amino acid sequence represented bySEQ ID No. 8 in Sequence Listing. By the northern hybridization with theuse of a DNA fragment of the spaur1 gene as a probe, mRNAs are detectedfrom a sensitive strain (FIG. 10). Thus the expression of the spaur1gene is confirmed.

FIG. 10 is an autoradiogram showing the results of the northernhybridization wherein mRNAs obtained from cells of a sensitive strain ofSchizo. pombe in the logarithmic growth phase (lane 1), cells of aresistant strain in the logarithmic growth phase (lane 2), cells of thesensitive strain in the stationary phase (lane 3) and cells of theresistant strain in the stationary phase (lane 4) are electrophoresed ona 1.2% agarose gel containing formaldehyde.

The tenth invention of the present invention relates to an antibodyagainst the above-mentioned protein regulating aureobasidin sensitivity.For example proteins having amino acid sequences of SEQ ID Nos. 2, 4, 6,8, 10, 14, 16 or 22 in Sequence Listing and peptides comprising someparts of these amino acid sequences may be used as an antigen. Theformer antigens can be prepared through the expression in a transformantfollowed by purification, while the latter antigens can be synthesizedon, for example, a marketed synthesizer. The antibody is produced by theconventional method. For example, an animal such as a rabbit isimmunized with the above-mentioned protein or a peptide fragmenttogether with an adjuvant to thereby give a polyclonal antibody. Amonoclonal antibody can be produced by fusing antibody-producing Bcells, which have been obtained by immunizing with an antigen, withmyeloma cells, screening hybridomas producing the target antibody, andincubating these cells. As will be described hereinafter, theseantibodies are usable in the treatment and diagnosis for animal andhuman diseases in which the above-mentioned proteins participate, suchas mycoses.

For example, a peptide corresponding to the part of the 103- to113-positions in the amino acid sequence of SEQ ID No. 8 is synthesizedon a synthesizer and then bound to a carrier protein. Then a rabbit isimmunized therewith and thus a polyclonal antibody is obtained. In thepresent invention, keyhole limpet hemocyanin (KLH) is used as thecarrier protein. Alternatively, bovine serum albumin and ovalbumin areusable therefor.

The eleventh invention of the present invention relates to a process fordetecting a protein regulating aureobasidin sensitivity by using theabove-mentioned antibody. The detection can be carried out by detectingthe binding of the antibody to the protein or measuring the amount ofbinding. For example, the protein or the cells producing the same can bedetected by treating with a fluorescence-labeled antibody and thenobserving under a fluorescence microscope. The amount of the antibodybound to the protein can be measured by various known methods. Forexample, S. cerevisiae cells are stained by the immunofluorescentantibody technique by using the above-mentioned antibody and a secondaryantibody such as FITC-labeled antirabbit antibody. Thus it is clarifiedthat the protein encoded by the scaur1 gene is distributed all over thecells. Further, a yeast having the recombinant plasmid of FIG. 9introduced thereinto is incubated in a medium containing glucose orgalactose. The cells thus obtained are disrupted with glass beads andproteins are solubilized. Then these proteins are separated bySDS-polyacrylamide gel electrophoresis (SDS-PAGE) and the westernblotting is carried out in the conventional manner by using theabove-mentioned polyclonal antibody and peroxidase-labeled anti-rabbitantibody. Consequently, the protein encoded by the scaur1 gene can bedetected, as FIG. 11 shows.

FIG. 11 shows the results of the western blotting wherein the proteinsprepared from the cells obtained by the incubation in the presence ofglucose (lane 1) or galactose (lane 2) are subjected to SDS-PAGE. A mainband binding to the polyclonal antibody of the present invention isdetected at around 38 kDa.

The twelfth invention of the present invention relates to a process fordetecting a gene regulating aureobasidin sensitivity, for example, mRNAat the expression of a protein, by using the above-mentionedoligonucleotide as a nucleic acid probe. This process is applicable tothe diagnosis for various diseases, including mycoses, associated withan abnormal amount of mRNA coding for the protein. For example, nucleicacids are precipitated from disrupted cells and mRNA is hybridized witha radioisotope-labeled nucleic acid probe on a nitrocellulose membrane.The amount of binding can be measured by autoradiography (FIG. 10) orwith a scintillation counter.

The thirteenth invention of the present invention relates to a processfor efficient screening of a novel antimycotic by using the transformantof the seventh invention of the present invention or the proteinregulating aureobasidin sensitivity of the ninth invention of thepresent invention. For example, a drug exerting its effect on theprotein or the gene of the present invention can be efficiently foundout through a comparison of the activity on a transformant containing asensitive gene with the activity on a transformant containing aresistant gene or a comparison between the activities on transformantsdiffering in expression level from each other. Also, the screening canbe efficiently carried out by measuring the affinity for the protein ofthe present invention, for example, the activity of inhibiting thebinding of radiolabeled-aureobasidin to the protein.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a restriction enzyme map of the genes spaur1^(R) andspaur1^(S) regulating aureobasidin sensitivity.

FIG. 2 is a restriction enzyme map of scaur1^(R) and scaur1^(S).

FIG. 3 is a restriction enzyme map of scaur2^(R) and scaur2^(S).

FIG. 4 shows the structure of a DNA for disrupting the Schizo. pombespaur1^(S) gene.

FIG. 5 shows the structure of a DNA for disrupting the S. cerevisiaescaur1^(S) gene.

FIG. 6 shows the results of the detection of the aur1 gene caaur1carried by C. albicans by the PCR method.

FIG. 7 is a restriction enzyme map of the caaur1 gene carried by C.albicans.

FIG. 8 is a restriction enzyme map of the caaur2 gene.

FIG. 9 shows the structure of a plasmid YEpSCARW3 for expressing thescaur1 gene.

FIG. 10 shows the results of the northern hybridization of the spaur1gene of Schizo. pombe.

FIG. 11 shows the results of the detection of the scaur1 protein byusing an antibody.

FIG. 12 is a restriction enzyme map of pAR25.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS(S) Examples

To further illustrate the present invention in greater detail, thefollowing Examples will be given. However it is to be understood thatthe present invention is not restricted thereto.

Example 1

Cloning of a gene regulating aureobasidin sensitivity originating infission yeast Schizo. pombe

1-a) Separation of aureobasidin-resistant mutant of Schizo. pombe

About 1×10⁸ cells of a Schizo. pombe haploid cell strain JY745 (matingtype h⁻, genotype ade6-M210, leu1, ura4-D18) exhibiting a sensitivity toaureobasidin at a concentration of 0.08 μg/ml were suspended in 1 ml ofa phosphate buffer containing 0.9% NaCl. Then the cells were mutagenizedwith EMS at a final concentration of 3% at 30° C. for 90 minutes. Afterneutralizing by adding 8 ml of 5% sodium thiosulfate, the cells thustreated were harvested by centrifugation (2500 r.p.m., 5 minutes),washed twice with 6 ml of physiological saline and then suspended in 2ml of a YEL medium (3% of glucose, 0.5% of yeast extract). Thesuspension was incubated at 30° C. for 5 hours under stirring and thenspreaded on a YEA plate (the YEL medium containing 1.5% of agar)containing 5 μg/ml of aureobasidin A. After incubating at 30° C. for 3to 4 days, two or three aureobasidin-resistant colonies were formed per1×10⁸ cells. After carrying out the mutagenesis several times, fiveclone mutants, i.e., THR01, THR04, THR05, THR06 and THR07 were obtained.These mutants were resistant to more than 25 μg/ml of aureobasidin A butthe same as the parent strain in the sensitivity to cycloheximide andamphotericin B. Therefore it is estimated that they are not mutantshaving a multiple drug resistance but ones having a resistance specificto aureobasidin.

1-b) Genetic analysis

Each of the above-mentioned resistant strains THR01, THR04, THR05, THR06and THR07 was crossed with normal cells of Schizo. pombe LH121 strain(mating type h⁺, genotype ade6-M216, ura4-D18) differing in mating type.Diploid cells obtained were examined about the resistance toaureobasidin. Similar to the resistant strains, the five diploids formedby crossing the resistant strains with the normal one were resistant to25 μg/ml of aureobasidin A, thus proving that these resistant mutationswere dominant. To perform the tetrad analysis, the above-mentioneddiploids were subsequently inoculated on an MEA medium (3% of maltextract, 2.5% of agar) for sporulation and incubated at 25° C. for 2days. Prior to the meiosis, the diploid cells replicated DNA on the MEAmedium and then underwent the meiosis to form asci each containing fourascospores of the haploid. These spores were separated with amicromanipulator and allowed to germinate on the YEA plate, followed bythe formation of colonies. Then the resistance to aureobasidin of thesecolonies was examined. Among four spores contained in an ascus, theseparation of the sensitivity versus the resistance showed 2:2. Thisresult indicates that the aureobasidin resistant mutation was induced bya mutation in single gene. Further, the complementation test wasperformed in order to confirm whether the resistant genes of theabove-mentioned five mutants were identical with each other or not. Forexample, a mutant of the mating type h⁺, which had been obtained bycrossing the mutant THR01 with the LH121 strain in the above tetradanalysis, was crossed with another variant THR04 (mating type h³¹) onthe MEA plate as described above and, after sporulation, the tetradanalysis was carried out. As a result, all of the colonies formed fromfour ascospores showed resistance to aureobasidin, which indicates thatthe mutational genes of THR01 and THR04 are the same with each other.Similarly, the five mutants were examined and it was thus found out thatall mutations occurred on the same gene. This gene regulatingaureobasidin sensitivity is named spaur1, the normal gene (sensitivegene) is named spaur1^(S) and the mutational gene (resistant gene) isnamed spaur1^(R).

1-c) Preparation of genomic library of aureobasidin resistant strain

Genomic DNA was extracted and purified from the aureobasidin resistantstrain THR01 by the method of P. Philippsen et al. [Methods inEnzymology, 194, 169-175 (1991)]. The purified genomic DNA (8 μg) waspartially digested by treating with 5 U of a restriction enzyme HindIIIat 37° C. for 10 minutes, deproteinized with phenol/chloroform andprecipitated with ethanol. The partially digested DNA waselectrophoresed on a 0.8% agarose gel and DNA in the region of 3 to 15kb was extracted. The DNA thus obtained was ligated with a yeast-E. colishuttle vector pAU-PS (2 μg) which had been completely digested withHindIII by using a DNA ligation kit (manufactured by Takara Shuzo Co.,Ltd.) and then transformed into E. coli HB101. Thus a genomic library ofthe aureobasidin resistant strain was formed. E. coli containing thisgenomic library was incubated in 50 ml of an LB medium (1% of bactotrypton, 0.5% of bacto yeast extract, 0.5% of sodium chloride)containing 100 μg/ml of ampicillin and 25 μg/ml of tetracycline at 37°C. overnight. Then a plasmid was recovered and purified from the E. colicells.

1-d) Expression and cloning of aureobasidin resistant gene spaur1^(R)

The plasmid originating in the genomic library of the aureobasidinresistant strain as prepared above was transformed into a strain Schizo.pombe JY745 by the method of Okazaki et al. [Nucleic Acid Research, 18,6485-6489 (1990)]. The transformed cells were spreaded on a minimummedium SD plate [0.67% of yeast nitrogen base without amino acids(manufactured by Difco), 2% of glucose, 2% of agar] containing 75 μg/mlof adenine sulfate and 50 μg/ml of leucine. After incubating at 30° C.for 3 to 4 days, the colonies thus formed were replicated onto an SDplate containing 5 μg/ml of aureobasidin A, 75 μg/ml of adenine sulfateand 50 μg/ml of leucine. It is conceivably that a colony propagated onthis plate may have the plasmid containing the aureobasidin resistantgene. This colony was inoculated into .5 ml of a liquid SD mediumcontaining 75 μg/ml of adenine sulfate and 50 μg/ml of leucine. Afterincubating at 30° C. for 2 days, the plasmid was recovered from thepropagated cells by the method of I. Hagan et al. [J. Cell Sci., 91,587-595 (1988)] . Namely, the cells were harvested from the culture (5ml) by centrifugation and then suspended in 1.5 ml of 50 mMcitrate/phosphate buffer containing 1.2 M of sorbitol and 2 mg/ml ofZymolyase. Then the suspension was maintained at 37° C. for 60 minutes.The cells were collected by centrifuging at 3,000 r.p.m. for 30 secondsand suspended in 300 μl of a TE [10 mM of Tris-HCl, pH 8, 1 mM of EDTA]solution. After adding 35 μl of 10% SDS, the mixture was maintained at65° C. for 5 minutes. After adding 100 μl of 5 M potassium acetate, themixture was allowed to stand in ice for 30 minutes. Then it wascentrifuged at 10,000 r.p.m. at 4° C. for 10 minutes and a plasmid DNAwas purified from the supernatant by using EASYTRAP™ (manufactured byTakara Shuzo Co., Ltd.).

This plasmid was transformed into E. coli HB101 and a plasmid DNA wasprepared from E. coli colonies formed on an LB medium containingampicillin and tetracycline. This plasmid, which contained a DNA of 4.5kb, was named pAR25. FIG. 12 shows the restriction enzyme map of the DNAof 4.5 kb in pAR25. To specify the gene region, HindIII fragments orSacI fragments of various sizes were subcloned into the pAU-PS vector.These DNAs were transformed into normal JY745 cells by theabove-mentioned method of Okazaki et al. and the acquisition ofaureobasidin resistance was examined. As a result, it is revealed that aHindIII-SacI 2.4 kb DNA fragment contains the spaur1^(R) gene. Therestriction enzyme map of this DNA segment containing the aureobasidinresistant gene spaur1^(R) is shown in FIG. 1. This fragment was clonedinto a pUC118 vector (named pUARS2R) and then the DNA nucleotidesequence was identified (SEQ ID No. 1 in Sequence Listing). From thisnucleotide sequence, it is revealed that the spaur1^(R) gene code for aprotein having an amino acid sequence represented by SEQ ID No. 2 inSequence Listing.

1-e) Cloning of aureobasidin sensitive gene spaur1^(S)

By the same method as the one employed in the above c), genomic DNA wasextracted and purified from normal cells. After partially digesting withHindIII, a genomic library of the normal cells was constructed. An E.coli stock containing this library DNA was spreaded on an LB agar mediumcontaining ampicillin and tetracycline and incubated overnight at 37° C.The colonies thus formed were transferred onto a nylon membrane(Hybond™-N, manufactured by Amersham) and the colony hybridization wasperformed.

As a probe, the above-mentioned DNA fragment (2.4 kb) obtained bycleaving the spaur1^(R) gene with HindIII-SacI and labeled with [α-³² p]dCTP by using a random primer DNA labeling kit (manufactured by TakaraShuzo Co., Ltd.) was used. As the results of screening of 5×10⁴colonies, five clones being hybridizable with the probe were obtained.Plasmids were purified from E. coli cells of these five clones. As theresult of the cleavage with restriction enzymes, it was found out thatall of these clones contained the same DNA fragment of 4.5 kb (namedpARN1). The restriction enzyme map of the DNA of 4.5 kb in pARN1 wasidentical with that of pAR25 shown in FIG. 10. Therefore, a HindIII-SacI2.4 kb DNA fragment which was a region containing the spaur1^(S) genewas prepared from pARN1. Then it was cloned into the pAU-PS vector andthis plasmid was named pSPAR1.

By using this plasmid pSPAR1, a strain E. coli JM109 was transformed andthe transformant thus obtained was named and designated as Escherichiacoli JM109/pSPAR1. It has been deposited at National Institute ofBioscience and Human-Technology, Agency of Industrial Science andTechnology in accordance with the Budapest Treaty under the accessionnumber FERM BP-4485. This DNA fragment containing the aureobasidinsensitive gene spaur1^(S) had the restriction enzyme map shown in FIG. 1and the DNA nucleotide sequence thereof was the one represented by SEQID No. 3 in Sequence Listing. Based on this nucleotide sequence, it hasbeen revealed that the spaur1^(S) gene codes for a protein having theamino acid sequence represented by SEQ ID No. 4 in Sequence Listing and,when compared with the resistant gene spaur1^(R), the amino acid at theresidue 240 has been changed from glycine into cysteine.

Example 2

Cloning of aureobasidin sensitive genes scaur1 and scaur2 originating inbudding yeast S. cerevisiae

2-a) Separation of aureobasidin resistant mutant of S. cerevisiae

A strain S. cerevisiae DKD5D (mating type a, genotype leu2-3 112, trp1,his3) having a sensitivity to aureobasidin at a concentration of 0.31μg/ml was mutagenized with EMS in the same manner as the one employed inthe case of Schizo. pombe. Then resistant mutants were separated on anagar plate of a complete nutritional medium YPD (1% of yeast extract, 2%of polypeptone, 2% of glucose) containing 5 μg/ml or 1.5 μg/ml ofaureobasidin A. After repeating the mutagenesis several times, 34 mutantclones were obtained. These mutants were resistant to more than 25 μg/mlof aureobasidin A and estimated as having not a multiple drug resistancemutation but an aureobasidin-specific resistance mutation.

2-b) Genetic analysis

Similar to the above-mentioned case of Schizo. pombe, the geneticanalysis using the tetrad analysis and the complementation test wasperformed. As a result, the genes could be classified into two types.These genes regulating aureobasidin sensitivity were named scaur1 andscaur2, the resistant genes isolated from the resistant mutant werenamed scaur1^(R) and scaur2R, and the sensitive genes isolated from thesensitive wild-type strain were named scaur1^(S) and scaur2^(S),respectively.

The R94A strain had a gene with dominant mutation (scaur1^(R)). It hasbeen further clarified that the scaur1 gene is located in theneighborhood of the met14 gene of the eleventh chromosome.

2-c) Preparation of genomic library of aureobasidin resistant strainhaving aureobasidin resistant gene scaur1^(R)

Genomic DNA was extracted and purified from the aureobasidin resistantstrain R94A by the above-mentioned method of P. Philippsen et al. Thepurified genomic DNA (8 μg) was partially digested by treating with 5 Uof a restriction enzyme HindIII at 37° C. for 10 minutes, deproteinizedwith phenol/chloroform and precipitated with ethanol. The partiallydigested DNA thus obtained was electrophoresed on a 0.8% agarose gel andDNA in the region of 3 to 15 kb was extracted. The DNA thus obtained wasligated with a yeast-E. coli shuttle vector pWH5 (2 μg) which had beencompletely digested with HindIII by using a DNA ligation kit and thentransformed into E. coli HB101. Thus a genomic library was formed. E.coli containing this genomic library was cultured in 50 ml of an LBmedium containing ampicillin and tetracycline at 37° C. overnight. Thena plasmid was recovered and purified from the E. coli cells.

2-d) Expression and cloning of aureobasidin resistant gene scaur1^(R)

The above-mentioned genomic library of the R94A strain was transformedinto S. cerevisiae SH3328 (mating type α, genotype ura3-52, his4, thr4,leu2-3•112) in accordance with the method of R. H. Schiestl et al.[Current Genetics, 16, 339-346 (1989)]. The transformed cells werespread on a minimum medium SD plate [0.67% of yeast nitrogen basewithout amino acids, 2% of glucose, 2% of agar] containing 25 μg/ml ofuracil, 35 μg/ml of histidine and 500 μg/ml of threonine. Afterincubating at 30° C. for 3 to 4 days, the colonies thus formed werereplicated onto a YPD agar plate containing 1.5 μg/ml of aureobasidin A.A colony thus formed was inoculated into 5 ml of a liquid YPD medium.After incubating at 30° C. for 2 days, a plasmid DNA was recovered fromthe propagated cells by the above-mentioned method of I. Hagan et al.This plasmid was transformed into a yeast again and it was confirmedthat the obtained transformant had acquired aureobasidin resistance.This plasmid, which contained a DNA of 3.5 kb, was named pWTCR3. Neitherthe DNA fragment of 2.0 kb nor the DNA fragment of 1.5 kb obtained bycleaving with HindIII exhibited any aureobasidin resistant activityalone. Thus it is confirmed that the gene is contained in the DNAfragment of 3.5 kb. FIG. 2 shows the restriction enzyme map of this DNAfragment of 3.5 kb containing the aureobasidin resistant genescaur1^(R). The HindIII fragments of 1.5 kb and 2 kb were each clonedinto pUC118, followed by the determination of the DNA nucleotidesequence (SEQ ID No. 5 in Sequence Listing). From this nucleotidesequence, it has been revealed that the scaur1^(R) gene codes for aprotein having an amino acid sequence represented by SEQ ID No. 6 inSequence Listing.

2-e) Cloning of aureobasidin sensitive gene scaur1^(S) corresponding toaureobasidin resistant gene scaur1^(R)

By the same method as the one employed in the above Example 2-c),genomic DNA was extracted and purified from the parent strain S.cerevisiae DKD5D. After partially digesting with HindIII, the DNA wasligated with pWH5 and transformed into E. coli HB101. Thus a genomiclibrary of the normal cells was formed. An E. coli stock containing thislibrary DNA was spreaded on an LB agar medium containing ampicillin andtetracycline and incubated overnight at 37° C. The colonies thus formedwere transferred onto a nylon membrane (Hybond™-N) and the colonyhybridization was carried out. As a probe, the DNA fragment of 3.5 kbobtained in the above Example 2-d) and labeled with [α-³² P] dCTP byusing a random primer DNA labeling kit (manufactured by Takara ShuzoCo., Ltd.) was used. As the results of screening of 2×10⁴ colonies,seven clones being hybridizable with the probe were obtained. Plasmidswere purified from E. coli cells of these clones. As the result of thecleavage with restriction enzymes, one of these clones contained a DNAfragment of 3.5 kb. This DNA fragment had the restriction enzyme map ofFIG. 2 and thus judged as containing the scaur1^(S) gene. The plasmidcontaining this DNA fragment was named pSCAR1, while E. coli HB101having this plasmid introduced therein was named and designated asEscherichia coli HB101/pSCAR1. This strain has been deposited atNational Institute of Bioscience and Human-Technology, Agency ofIndustrial Science and Technology in accordance with the Budapest Treatyunder the accession number FERM BP-4483. The DNA fragment of 3.5 kbobtained by partially digesting pSCAR1 with HindIII was subcloned intopUC118 and the nucleotide sequence thereof was determined (SEQ ID No. 7in Sequence Listing). A comparison with the resistant gene indicatesthat the base at the position 852 has been changed from T into A and,due to this replacement, the amino acid has been converted fromphenylalanine into tyrosine (SEQ ID No. 8 in Sequence Listing).

2-f) Preparation of genomic library of aureobasidin resistant strainhaving aureobasidin resistant gene scaur2^(R)

A genomic library was prepared from an aureobasidin resistant strainL22-8B by the same method as the one described in Example 2-c). E. colicontaining this genomic library was cultured in an LB medium (50 ml)containing ampicillin and tetracycline at 37° C. overnight. Thenplasmids were recovered and purified from the E. coli cells.

2-g) Expression and cloning of aureobasidin resistant gene scaur2^(R)

The above-mentioned plasmids originating in the genomic library of theL22-8B strain were transformed into S. cerevisiae SH3328 by theabove-mentioned method of R. H. Schiestl. From the transformed strains,an aureobasidin resistant strain was isolated. Then a plasmid DNAcontaining the scaur2^(R) gene was recovered from this transformant bythe above-mentioned method of I. Hagan et al. This plasmid wastransformed into a yeast again and it was confirmed that thetransformant had acquired aureobasidin resistance. This plasmid, whichcontained a DNA of 8.5 kb, was named pSCAR2. FIG. 3 shows therestriction enzyme map of the DNA fragment of 8.5 kb containing thisaureobasidin resistant gene scaur2^(R). E. coli HB101 having thisplasmid pSCAR2 introduced therein was named and designated asEscherichia coli HB101/pSCAR2. This strain has been deposited atNational Institute of Bioscience and Human-Technology, Agency ofIndustrial Science and Technology in accordance with the Budapest Treatyunder the accession number FERM BP-4484. By using BamHI, EcoRI, HindIIIand PstI, DNA fragments of various sizes were prepared and cloned intothe pWH5 vector. These plasmids were transformed into S. cerevisiaeDKD5D in accordance with the above-mentioned method of R. H. Schiestl etal. Then it was examined whether these transformants had acquiredaureobasidin resistance or not. As a result, none of the transformantsof the DNA fragments was a resistant one. Thus it has been clarifiedthat the DNA fragment of the full length is necessary for the expressionof the aureobasidin resistance.

2-h) Isolation of aureobasidin sensitive gene scaur2^(S) correspondingto aureobasidin resistant gene scaur2^(R)

An E. coli stock containing the genomic library of Example 2-e) preparedfrom normal cells of S. cerevisiae DKD5D was spreaded on an LB agarmedium containing ampicillin and tetracycline and incubated at 37° C.overnight. The colonies thus formed were transferred onto a nylonmembrane (Hybond™-N) and the colony hybridization was performed. As aprobe the DNA fragment of 8.5 kb obtained in the above Example 2-g) andlabeled with [α-³² P] dCTP by using a random primer DNA labeling kit wasused. As the results of screening of 2×10⁴ colonies, several clonesbeing hybridizable with the probe were obtained. Some of these clonescontained a DNA fragment of 4.6 kb while others contained a DNA fragmentof 3.9 kb. From the restriction enzyme maps of these DNA fragments, itwas found out that these DNA fragments were each a part of thescaur2^(S) gene shown in FIG. 3. These DNA fragments were ligatedtogether to thereby give a scaur2^(S) fragments shown in FIG. 3. The DNAfragment of 8.5 kb thus obtained was subcloned into pUC118 and then theDNA nucleotide sequence was determined (SEQ ID No. 9 in SequenceListing). Based on the nucleotide sequence of SEQ ID No. 9 in SequenceListing, the amino acid sequence represented by SEQ ID No. 10 inSequence Listing was deduced.

Example 3

Gene disruption test on spaur1^(S) and scaur1^(S) genes

3-a) Gene disruption test on spaur1^(S) gene

In order to examine whether the aureobasidin sensitive gene spaur1^(S)is necessary in the cell growth by the gene disruption test, the plasmidpUARS2R prepared in Example 1-d) was first cleaved with BalI andEcoT22I. After eliminating a DNA fragment of 240 bp, the residual DNAfragment was blunted by using a DNA blunting kit (manufactured by TakaraShuzo Co., Ltd.). Then this DNA was ligated with a DNA containing ura4⁺gene of 1.7 kb, which had been obtained by excising from a pUC8ura4plasmid [Mol. Gen. Genet., 215, 81-86 (1988)] by cleaving with HindIIIand blunting, to thereby give a plasmid pUARS2RBT22::ura4-1 and anotherplasmid pUARS2RBT22::ura4-6 in which the ura4 DNA had been inserted inthe opposite direction. Both of these disrupted genes were excised fromthe vector pUC118 by cleaving with SacI and HindIII andARS2RBT22::ura4-1 and ARS2RBT22::ura4-6 (FIG. 4), which were spaur1^(S)DNA fragments containing ura4⁺, were purified. The purified DNAfragments were transformed into diploid cells Schizo. pombe C525 (h⁹⁰/h⁹⁰, ura4-D18/ura4-D18, leu1/leu1, ade6-M210/ade6-M216) by theabove-mentioned method of Okazaki et al. and then a transformant wasscreened on an SD agar plate containing leucine. In the transformantthus obtained, one of a pair of spaur1^(S) genes on the chromosome hadbeen replaced by the disrupted gene ARS2RBT22::ura4-1 orARS2RBT22::ura4-6 introduced thereinto. These cells were allowed toundergo sporulation on a sporulation medium MEA and subjected to thetetrad analysis. As a result, it was found out that two of the fourascospores formed colonies but the residual two spores formed no colony.That is to say, the spores suffering from the replacement of the normalspaur1^(S) gene by the disrupted gene ARS2RBT22::ura4-1 were notpropagated. It has been thus revealed that the spaur1^(S) gene isessentially required for the growth of the cells.

3-b) Gene disruption test on scaur1^(S) gene

The plasmid pSCAR1 prepared in Example 2-e) was partially digested withHindIII to thereby give a DNA fragment of 3.5 kb shown in FIG. 2. ThisDNA fragment was cloned into the HindIII site of pUC119 and the obtainedproduct was named pSCAR3. The obtained. pSCAR3 was cleaved with StuI andEcoT22I. After eliminating a DNA fragment of 0.3 kb, the obtained DNAwas ligated with a DNA fragment (1.1 kb) of URA3 gene which had beenobtained by cleaving a plasmid pYEUra3 (manufactured by ClontechLaboratories, Inc.) with HindIII and EcoRI and blunting. Thus a plasmidpUSCAR3.ST22::URA3⁺ and another plasmid pUSCAR3.ST22::URA3A, in whichthe URA3 gene had been inserted in the opposite direction, wereobtained. These disrupted gene were excised in the EcoRI site in thescaur1^(S) gene and the EcoRI site in the pUC119 vector by cleaving withEcoRI. The scaur1^(S) DNA fragments containing URA3, SCAR3.ST22::URA3⁺and SCAR3.ST22::URA3A (FIG. 5), were purified. The purified DNAfragments were transformed into diploid cells of S. cerevisiae AOD1(mating type a/α, genotype ura3-52/ura3-52, leu2-3 112/leu2-3 112,trp1/TRP1, thr4/THR4, his4/HIS4) by the above-mentioned method of R. H.Schiestl and transformants were screened on an SD agar plate containingleucine. The transformants thus obtained were allowed to undergosporulation on a sporulation medium SP (1% of potassium acetate, 2% ofagar) and subjected to the tetrad analysis. As a result, it was foundout that two of the four ascospores underwent germination and formedcolonies but the residual two spores did not undergo colony formation.That is to say, the spores suffering from the replacement of thescaur1^(S) gene by the disrupted gene were not propagated. It has beenthus revealed that the scaur1^(S) gene is essentially required for thegrowth of the cells.

Example 4

Examination on the expression of aureobasidin sensitive gene spaur1 bynorthern hybridization

From a normal strain or a resistant strain of Schizo. pombe, the wholeRNAs were extracted and purified by the method of R. Jensen et al.[Proc. Natl. Acad. Sci. USA, 80, 3035-3039 (1983)]. Further, poly(A)⁺RNA was purified by using Oligotex™-dT30 (manufactured by Takara ShuzoCo., Ltd.). The purified poly(A)+RNA (2.5 μg) was separated by theelectrophoresis on a 1.2% agarose gel containing formaldehyde andtransferred onto a nylon membrane (Hybond™-N). After immobilizing, thehybridization was performed with the use of a HindIII-SacI fragment (2kb) of the spaur1^(R) gene labeled with [α-³² P]dCTP as a probe. As aresult, both of the normal cells and the resistant cells showed a bandof the same amount of about 2 kb. In both cases, this amount underwentno change in the logarithmic growth phase and the stationary phase (FIG.10). FIG. 10 is an autoradiogram showing the results of the northernhybridization wherein mRNAs obtained from cells of a sensitive strain ofSchizo. pombe in the logarithmic growth phase (lane 1), cells of aresistant strain in the logarithmic growth phase (lane 2), cells of thesensitive strain in the stationary phase (lane 3) and cells of theresistant strain in the stationary phase (lane 4) are electrophoresed ona 1.2% agarose gel containing formaldehyde.

Example 5

Determination of the activity of scaur1^(S) gene

5-a) Construction of plasmid YEpSCARW3 (FIG. 9) and YEpSCARW1

The plasmid pSCAR1 prepared in Example 2-e) was cleaved with HindIII anda fragment of 2 kb containing the whole ORF was excised. This fragmentwas inserted into the HindIII site of a expression-plasmid YEp52 havinga promoter Gal10, the expression of which was induced by galactose in amedium. The plasmid having the scaur1^(S) gene which had been insertedin such a direction as to be normally transcribed by the promoter Gal10was named YEpSCARW3. FIG. 9 shows the structure of this plasmid.Further, the plasmid having the scaur1^(S) gene inserted in the oppositedirection was named YEpSCARW1.

5-b) Transformation by plasmids YEpSCARW3 and YEpSCARW1

By using 5 μg portions of the plasmids YEpSCARW3 and YEpSCARW1, thediploid S. cerevisiae cells with the disrupted scaur1^(S) gene preparedin Example 3-b) were transformed. Then transformants were screened on anSD agar plate. These transformants were allowed to undergo sporulationon an SP medium and then subjected to the tetrad analysis. When theexpression of the scaur1^(S) gene was induced by using a YPGal medium(1% of yeast extract, 2% of polypeptone, 2% of galactose), theascospores formed from the diploid cells transformed by YEpSCARW3 allunderwent germination while two of the four ascospores formed from thediploid cells transformed by YEpSCARW1 underwent germination but not theremaining two. It is thus conceivable that the cells with the disruptedscaur1^(S) gene have reverted to the normal state by introducingYEpSCARW3 containing the scaur1^(S) gene into these cells. Accordingly,the use of these cells with the disrupted scaur1^(S) gene as a hostmakes it possible to determine the activity of normal aur1-analogousgenes carried by other organisms.

Example 6

Confirmation and cloning of aur1 and aur2 genes (caaur1, caaur2) carriedby C. albicans

6-a) Detection of aur1 gene by the PCR method

Poly(A)⁺ RNA was extracted and purified from an aureobasidin sensitivestrain C. albicans TIMM0136 by the same method as the one employed inExample 4. By using the poly(A)⁺ RNA (5 μg) as a template, adouble-stranded cDNA was synthesized on a cDNA synthesizing system Plus(manufactured by Amersham) with the use of an oligo(dT) primer. Mixedprimers for PCR corresponding to amino acid sequence regions beingcommon to the amino acid sequences of S. cerevisiae and Schizo. pombewere synthesized on a DNA synthesizer and purified. That is to say, aprimer of SEQ ID No. 11 in Sequence Listing corresponding to the regionof amino acids at the 184- to 192-positions of SEQ ID No. 4 in SequenceListing of Schizo. pombe (from the 184- to 192-positions of SEQ ID No. 8in Sequence Listing of S. cerevisiae) and another primer of SEQ ID No.12 in Sequence Listing corresponding to the region of amino acids fromthe 289- to 298-positions of Schizo. pombe (from the 289- to298-positions of SEQ ID No. 8 in Sequence Listing of S. cerevisiae) wereemployed.

PCR was performed by using these primers and the above-mentioned cDNA asa template by repeating a cycle comprising treatment at 94° C. for 30seconds, one at 48° C. for 1 minute and one at 72° C. for 2 minutes 25times. As a result, a DNA (about 350 bp) being almost the same as S.cerevisiae and Schizo. pombe in length was amplified (FIG. 6). FIG. 6shows a pattern obtained by carrying out PCR with the use of cDNA of C.albicans (lane 1), cDNA of S. cerevisiae (lane 2) and cDNA of Schizo.pombe (lane 3) as a template, electrophoresing each PCR product on anagarose gel and staining with ethidium bromide.

6-b) Cloning of aur1 gene (caaur1) of C. albicans

(i) Genomic DNA was extracted and purified from a strain C. albicansTIMM0136 by the same method as the one described in Example 1-c). Afterpartially digesting with HindIII, the DNA fragment was ligated with aTraplexll9 vector which had been completely digested with HindIII andtransformed into E. coli HB101. Thus a genomic library of C. albicanswas prepared. From this library, a DNA fragment of 4.5 kb containing theaur1 gene of C. albicans was cloned by using the DNA fragment of C.albicans obtained by the PCR described in Example 6-a), which had beenlabeled with [α-³² P]dCTP by using a random primer DNA labeling kit(manufactured by Takara Shuzo Co., Ltd.), as a probe. This DNA fragmenthad a restriction enzyme map shown in FIG. 7 and the DNA nucleotidesequence thereof is represented by SEQ ID No. 13 in Sequence Listing.Based on this nucleotide sequence, it was estimated that the caaur1 genecoded for a protein having the amino acid sequence represented by SEQ IDNo. 14 in Sequence Listing. When compared with the scaur1^(S) protein, ahomology of as high as 53% was observed. A Traplexll9 vector having thiscaaur1 gene integrated therein was named pCAAR1, while E. coli HB101transformed by this plasmid was named and designated as Escherichia coliHB101/pCAAR1. This strain has been deposited at National Institute ofBioscience and Human-Technology, Agency of Industrial Science andTechnology in accordance with the Budapest Treaty under the accessionnumber FERM BP-4482.

Next, pCAAR1 was treated with HindIII to thereby give caaur1 of 4.5 kb.Further, it was integrated into pTV118 which had been completelydigested with HindIII to thereby prepare a plasmid for expressingcaaur1. This plasmid was named pTCAAR1.

(ii) Genomic DNA was extracted and purified from a strain C. albicansTIMM1768 [The Journal of Antibiotics, 46, 1414-1420(1993)] by the samemethod as the one described in Example 1-c). After partially digestingwith Hind III, the DNA fragment was ligated with a pUC118 vector whichhad been completely digested with Hind III and transformed into E. coliHB101. Thus a genomic library of C. albicans TIMM1768 was prepared. Fromthis library, a DNA fragment of 4.5 kb containing the aur1 gene of C.albicans TIMM1768 was cloned by the colony hybridization with the sameprobe as that described in Example 6-b)-(i). This DNA fragment had thesame restriction enzyme map as that shown in FIG. 7. Next, a part of theDNA sequence containing a ORF in this DNA fragment was determined. TheDNA nucleotide sequence thereof is represented by SEQ ID No. 21 inSequence Listing. Based on this nucleotide sequence, it was estimatedthat this gene coded for a protein having the amino acid sequencerepresented by SEQ ID No. 22 in Sequence Listing. When the amino acidsequence of the caaur1 protein C. albicans TIMM1768 was compared withthat of the caaur1 protein of C. albicans TIMM0136, the amino acidsequences of the 1- to 381-positions and the 383- to 423-positions andthe 425- to 471-positions of caaur1 protein (SEQ ID No. 14 in SequenceListing) in C. albicans TIMM0136 were identical with the amino acidsequences of the 2- to 382-positions and the 384- to 424-positions andthe 426- to 472-positions, respectively, of caaur1 protein (SEQ ID No.22 in Sequence Listing) in C. albicans TIMM1768.

However, serines at the 382- and 424-positions of SEQ ID No. 14 inSequence Listing were replaced with prolines at the 383- and425-positions of SEQ ID No. 22 in Sequence Listing.

6-c) Cloning of aur2 gene (caaur2) of C. albicans

Genomic DNA of a strain C. albicans TIMM0136 was digested with BamHI andligated with a pTV118 vector which had been completely digested withBamHI. Then it was transformed into E. coli HB101 to thereby prepare agenomic library of C. albicans. On the other hand, the DNA fragmentcontaining the scaur2^(S) gene obtained in Example 2-h) was cleaved withHindIII and PstI to thereby give a DNA fragment of 1.2 kb. This DNAfragment was labeled with [α-³² P]dCTP by using a random primer DNAlabeling kit. By using this labeled DNA fragment as a probe, theabove-mentioned C. albicans genomic library was screened by the colonyhybridization. Thus a plasmid containing a DNA fragment of 8.3 kb wasobtained. A part of the DNA sequence upstream of the BamHI site of thisDNA fragment was determined (SEQ ID No. 15 in Sequence Listing). Basedon this sequence, an amino acid sequence represented by SEQ ID No. 16 inSequence Listing was estimated. It corresponded to the amino acidsequence of the 1230- to 1309-positions of the amino acid sequence ofthe scaur2 gene (SEQ ID No. 10), having a homology of as high as 77%.Since this DNA fragment lacked a part of the C-end, the genomic libraryprepared in Example 6-b) was further screened by using this DNA fragmentas a probe. Thus a DNA fragment of 6.5 kb having the C-terminal part wasobtained. FIG. 8 shows the restriction enzyme map of the DNA regioncontaining the caaur2 gene thus clarified.

A pTV118 vector having the above-mentioned caaur2 gene of 8.3 kbintegrated therein was named pCAAR2N, while E. coli HB101 transformed bythis plasmid was named and designated as Escherichia coli HB101/pCAAR2N.This strain has been deposited at National Institute of Bioscience andHuman-Technology, Agency of Industrial Science and Technology inaccordance with the Budapest Treaty under the accession number FERMBP-4481.

Example 7

Preparation of antibody against protein coded for by scaur1^(S) gene andstaining of S. cerevisiae cells and detection of said protein by usingthis antibody

7-a) Preparation of antibody

SCAR1-1 (SEQ ID No. 19 in Sequence Listing) comprising a peptidecorresponding to the amino acids at the residue 103 to 113 in the aminoacid sequence of SEQ ID No. 8 in Sequence Listing having cysteine addedto the N-end thereof and SCAR1-2 (SEQ ID No. 20 in Sequence Listing)comprising a peptide corresponding to the amino acids at the residue 331to 348 in the amino acid sequence of SEQ ID No. 8 having cysteine addedto the N-end thereof were synthesized by the Fmoc solid phase synthesismethod and purified by reverse phase HPLC. Thus 10 mg portions of thesepeptides were obtained. To the N-terminal cysteine of each of thesesynthetic peptides, KLH was bound as a carrier protein. By using thisbinding product as an antigen, a rabbit was immunized and an antiserumwas obtained. This antiserum was further purified on an affinity columnprepared by binding the synthetic peptide employed as the antigen to anagarose gel. This a polyclonal antibody being specific for the syntheticpeptide was prepared.

7-b) Staining of S. cerevisiae cells with antibody

A strain S. cervisiae ATCC 9763 was cultured in a YNBG medium [0.67% ofyeast nitrogen base (manufactured by Difco), 2% of glucose] to therebygive a suspension of a concentration of 3×10⁷ cells/ml. To 1 ml of thiscell suspension were added 0.11 ml of a 1 M phosphate buffer (pH 6.5)and 0.17 ml of 37% formaldehyde. After slowly stirring at roomtemperature for 1 hour, the cells were harvested by centrifugation andthen suspended in 20 ml of an SS buffer (1 M of sorbitol, 0.2 % ofβ-mercaptoethanol, 0.1 M phosphate buffer, pH 7.5) containing 20 μg/mlof Zymolyase 20T. After treating at 30° C. for 1 hour, the cells wereharvested, washed with the SS buffer, suspended in 1 ml of the SS buffercontaining 0.1% of Triton X-100 and then allowed to stand for 10minutes. This cell suspension was placed on a slide glass which had beencoated with poly(L-lysine) and allowed to stand for 10 minutes. Next, aPBS solution containing 1% of albumin (BSA) was dropped thereinto. Afterallowing to stand at room temperature for 15 minutes, the excessiveliquid was removed and then a PBS solution containing BSA containing0.02 mg/ml of the antiSCAR1-1 antibody was dropped thereinto. Afterallowing to stand at room temperature for 60 minutes and washing withPBS containing BSA three times, antirabbit IgG antibody labeled withFITC (antibody concentration 0.02 mg/ml) was layered over and allowed tostand at room temperature for 1 hour. After washing with a PBS solutioncontaining BSA, a small amount of a mountain solution, which was asolution prepared by dissolving 0.1 g of p-phenylenediamine in 10 ml ofCBS (150 mM of NaCl, 50 mM of CHES, pH 9.5), adjusting the pH value to9.0 with 10 N NaOH and further adding 90 ml of glycerol, was layeredover. Then a cover glass was placed thereon to thereby give a specimen.This specimen was observed under a fluorescence microscope to therebyexamine the intracellular distribution of the scaur1 protein. As aresult, it was found out that this protein was distributed all over thecells.

7-c) Detection of protein coded for by scaur1 gene by using antibody

The plasmid YEpSCARW3 prepared in Example 5-a) was introduced into anormal haploid S. cerevisiae SH3328 to thereby give a transformant. Thistransformant was cultured in a YPGal medium or a YPD medium and thecells were harvested by centrifugation. The cells thus obtained weresuspended in a buffer (1% of Triton X-100, 1% of SDS, 20 mM of Tris-HCl,pH 7.9, 10 mM of EDTA, 1 mM of DTT, 1 mM of PMSF). Further, glass beadswere added thereto to disrupt the cells by vigorous vortex. Then an SDSloading solution was added thereto and the protein was denatured bytreating at 95° C. for 5 minutes. After centrifuging, a part of theobtained supernatant was subjected to SDS-PAGE and the protein thusseparated was transfered onto an Immobilon membrane (manufactured byMILLIPORE). This Immobilon membrane was treated with Block Ace(manufactured by Dainippon Pharmaceutical Co., Ltd.). Then theantiSCAR1-2 antibody prepared in 7-a) was reacted therewith as a primaryantibody. After washing, antirabbit IgG antibody labeled with peroxidasewas reacted therewith as a secondary antibody and the mixture wasthoroughly washed. Next, it was color-developed with diaminobenzidineand a band of the scaur1 protein was detected. FIG. 11 shows theresults.

FIG. 11 shows the results of the detection of the protein prepared fromthe cells incubated in the YPD medium (lane 1) and the protein preparedfrom the cells incubated in the YPGal medium (lane 2), each subjected toSDS-PAGE, by using the antiSCAR1-2 antibody. The cells incubated in theYPGal medium, of which scaur1 gene had been induced, showed a specificband.

According to the present invention, a novel protein regulatingaureobasidin sensitivity and a gene coding for the protein, i.e., a generegulating aureobasidin sensitivity are provided. These substances areuseful in the diagnosis and treatment for diseases caused by organismshaving the above-mentioned gene, such as mycoses. The present inventionfurther provides an antisense DNA and an antisense RNA of this gene, anucleic acid probe being hybridizable with the gene, a process fordetecting the gene by using this nucleic acid probe, a process forproducing a protein regulating aureobasidin sensitivity by using atransformant having the gene introduced thereinto, an antibody for theprotein and a process for detecting the protein by using this antibody.They are also useful in the diagnosis and treatment of diseasesincluding mycoses.

    __________________________________________________________________________    #             SEQUENCE LISTING                                                   - -  - - (1) GENERAL INFORMATION:                                             - -    (iii) NUMBER OF SEQUENCES:  22                                         - -  - - (2) INFORMATION FOR SEQ ID NO:1:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH:  2385                                                             (B) TYPE:  nucleic a - #cid                                                   (C) STRANDEDNESS:  doub - #le                                                 (D) TOPOLOGY:  linear                                                - -     (ii) MOLECULE TYPE: genomic DNA                                       - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                               - - AAGCTTTTTT GCCTCTGCAA AAGTTCCTTT CTCGAATTGG TTTTTTGAGG AA -            #AAGCAAGT     60                                                                 - - TAATAAACTA ATTATATTAT ATATAATTAG CAATTTTATA AAAAAAATAA AA -            #AAATAGCC    120                                                                 - - CTGATTGCTG GCAACTGTGA GCTGAACATT GGTTAATCGG TCCATCTTTT TT -            #TAAATATT    180                                                                 - - TTACATCGCT ACTTTTAAGT GCTTGACACT TGCATTTAAT AGCTACTTTC TT -            #TCCTTCAT    240                                                                 - - AAAAATTCCT TTTTTTTCCT TTAGTTTTCC GGTTAATTCC TTACGAAATT TT -            #TTTCGTAC    300                                                                 - - GCTTCCCTTT TTTACTCTGA TAATTCTTTG AAGCAATGTC TGCTCTTTCG AC -            #CTTAAAAA    360                                                                 - - AGCGCCTTGC TGCGTGTAAC CGAGCATCCC AATACAAGTT GGAAACAAGC TT -            #AAACCCTA    420                                                                 - - TGCCTACATT TCGTTTGCTA CGCAATACGA AATGGTCATG GACACATTTG CA -            #ATATGTGT    480                                                                 - - TTCTAGCAGG TAATTTGATT TTTGCTTGTA TTGTCATTGA ATCTCCTGGA TT -            #CTGGGGGA    540                                                                 - - AATTTGGCAT TGCCTGTCTT TTGGCCATTG CGTTGACCGT TCCTTTAACA CG -            #CCAAATTT    600                                                                 - - TTTTTCCTGC CATTGTTATC ATCACCTGGG CAATTTTATT TTACTCTTGT AG -            #GTTTATTC    660                                                                 - - CAGAACGCTG GCGTCCACCC ATATGGGTTC GTGTTTTACC CACACTTGAA AA -            #TATTCTTT    720                                                                 - - ATGGCTCTAA TCTTTCTAGT CTTCTCTCGA AAACCACGCA TAGCATCCTT GA -            #TATTTTGG    780                                                                 - - CCTGGGTTCC ATATGGAGTC ATGCATTATT CGGCTCCTTT TATCATTTCA TT -            #TATTCTTT    840                                                                 - - TCATCTTTGC ACCTCCTGGA ACTCTTCCAG TTTGGGCTCG AACTTTTGGT TA -            #TATGAATT    900                                                                 - - TATTTGGTGT TCTTATCCAA ATGGCTTTCC CCTGTTCTCC TCCTTGGTAT GA -            #AAATATGT    960                                                                 - - ATGGTTTAGA ACCTGCCACG TATGCAGTAC GTGGCTCTCC TGGTGGATTG GC -            #CCGTATTG   1020                                                                 - - ATGCTCTCTT CGGCACTAGC ATTTACACTG ATTGTTTTTC TAACTCTCCG GT -            #TGTTTTTG   1080                                                                 - - GTGCCTTTCC ATCTCTTCAC GCTGGATGGG CCATGCTGGA AGCACTTTTC CT -            #TTCGCATG   1140                                                                 - - TGTTTCCTCG ATACCGCTTC TGCTTTTATG GATATGTTCT ATGGCTTTGC TG -            #GTGTACTA   1200                                                                 - - TGTACCTTAC CCACCACTAC TTTGTAGATT TGGTCGGCGG TATGTGTTTA GC -            #TATTATAT   1260                                                                 - - GCTTCGTTTT TGCTCAAAAG CTACGCCTCC CACAGTTGCA AACTGGTAAA AT -            #CCTTCGTT   1320                                                                 - - GGGAATACGA GTTTGTTATC CACGGTCATG GTCTTTCCGA AAAAACCAGC AA -            #CTCCTTGG   1380                                                                 - - CTCGTACCGG CAGCCCATAC TTACTTGGAA GGGATTCTTT TACTCAAAAC CC -            #TAATGCAG   1440                                                                 - - TAGCCTTCAT GAGTGGTCTT AACAATATGG AACTTGCTAA CACCGATCAT GA -            #ATGGTCCG   1500                                                                 - - TGGGTTCATC ATCACCTGAG CCGTTACCTA GTCCTGCTGC TGATTTGATT GA -            #TCGTCCTG   1560                                                                 - - CCAGTACCAC TTCCTCCATC TTTGATGCAA GTCATCTTCC TTAAATCAAC GT -            #GCTTTAAG   1620                                                                 - - AATATATTTC CAAAAGCTAC ATGATACATT GACTAGAATC GGTTTGATTC AT -            #AGTGGTAT   1680                                                                 - - TGGAATGATG TTGTTCATTG TGTTTTTTAA CTGTTAATCT GACATCCATT GA -            #GTCATTCT   1740                                                                 - - TTACAATTTG TAAAATTAAT TTGTATCACT AATTTTGAAG GAAGCTATTT TG -            #GTATTAAT   1800                                                                 - - ACCGCTTTTG GTCTCCACTT CCTTTTCGAA ACTCTTAACA GCGATTAGGC CG -            #GGTATCTT   1860                                                                 - - CCAGTGTGAT GTATAGGTAT TTGTCGTTTT TTTATCATTT CCGTTAATAA AG -            #AACTCTTT   1920                                                                 - - TATCCAGCTT CTTACACTGT CAACTGTTGT GAAAGGAACA CATTTAGAAT TT -            #CATTTTCC   1980                                                                 - - TTATTTGTTG TGATTTAAAT CGTTTGACAT AATTTTAAAT TTGGTTTGAA AT -            #GTGTGTGA   2040                                                                 - - GAAGGCTTGT TTTATTCATT TAGTTTATTG CTTGTTTGCA CGAAAATCCA GA -            #ACGGAGCA   2100                                                                 - - TTAATGTAAT CCTTTTTTAT TCTGTAAAGC GTTTTTATAC AAATGTTGGT TA -            #TACGTTTC   2160                                                                 - - TAAAATAAGA ATATTGTTAT AATAATATAG TTTTTTCTAT CATTTGTTAC AC -            #ACACTAAA   2220                                                                 - - GAGACATTAA GGATAAGCAA ATGTGTTAAA ATGATAATAT ATTTTGGAAA CA -            #TTTATAAA   2280                                                                 - - GAAATTAAGC AGCTTTGACT AACTACATTT TTGTTTTTTT CCTAAGCAAA AC -            #TGTATAGT   2340                                                                 - - TATACACGCG AGCTGTATTC ACTTCCATTG TAGTGACTTG AGCTC   - #                    2385                                                                        - -  - - (2) INFORMATION FOR SEQ ID NO:2:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH:  422                                                              (B) TYPE:  amino aci - #d                                                     (C) STRANDEDNESS:  sing - #le                                                 (D) TOPOLOGY:  linear                                                - -     (ii) MOLECULE TYPE: peptide                                           - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                               - - Met Ser Ala Leu Ser Thr Leu Lys Lys Arg Le - #u Ala Ala Cys Asn            1               5 - #                 10 - #                 15              - - Arg Ala Ser Gln Tyr Lys Leu Glu Thr Ser Le - #u Asn Pro Met Pro                           20 - #                 25 - #                 30              - - Thr Phe Arg Leu Leu Arg Asn Thr Lys Trp Se - #r Trp Thr His Leu                           35 - #                 40 - #                 45              - - Gln Tyr Val Phe Leu Ala Gly Asn Leu Ile Ph - #e Ala Cys Ile Val                           50 - #                 55 - #                 60              - - Ile Glu Ser Pro Gly Phe Trp Gly Lys Phe Gl - #y Ile Ala Cys Leu                           65 - #                 70 - #                 75              - - Leu Ala Ile Ala Leu Thr Val Pro Leu Thr Ar - #g Gln Ile Phe Phe                           80 - #                 85 - #                 90              - - Pro Ala Ile Val Ile Ile Thr Trp Ala Ile Le - #u Phe Tyr Ser Cys                           95 - #                100 - #                105              - - Arg Phe Ile Pro Glu Arg Trp Arg Pro Pro Il - #e Trp Val Arg Val                          110  - #               115  - #               120              - - Leu Pro Thr Leu Glu Asn Ile Leu Tyr Gly Se - #r Asn Leu Ser Ser                          125  - #               130  - #               135              - - Leu Leu Ser Lys Thr Thr His Ser Ile Leu As - #p Ile Leu Ala Trp                          140  - #               145  - #               150              - - Val Pro Tyr Gly Val Met His Tyr Ser Ala Pr - #o Phe Ile Ile Ser                          155  - #               160  - #               165              - - Phe Ile Leu Phe Ile Phe Ala Pro Pro Gly Th - #r Leu Pro Val Trp                          170  - #               175  - #               180              - - Ala Arg Thr Phe Gly Tyr Met Asn Leu Phe Gl - #y Val Leu Ile Gln                          185  - #               190  - #               195              - - Met Ala Phe Pro Cys Ser Pro Pro Trp Tyr Gl - #u Asn Met Tyr Gly                          200  - #               205  - #               210              - - Leu Glu Pro Ala Thr Tyr Ala Val Arg Gly Se - #r Pro Gly Gly Leu                          215  - #               220  - #               225              - - Ala Arg Ile Asp Ala Leu Phe Gly Thr Ser Il - #e Tyr Thr Asp Cys                          230  - #               235  - #               240              - - Phe Ser Asn Ser Pro Val Val Phe Gly Ala Ph - #e Pro Ser Leu His                          245  - #               250  - #               255              - - Ala Gly Trp Ala Met Leu Glu Ala Leu Phe Le - #u Ser His Val Phe                          260  - #               265  - #               270              - - Pro Arg Tyr Arg Phe Cys Phe Tyr Gly Tyr Va - #l Leu Trp Leu Cys                          275  - #               280  - #               285              - - Trp Cys Thr Met Tyr Leu Thr His His Tyr Ph - #e Val Asp Leu Val                          290  - #               295  - #               300              - - Gly Gly Met Cys Leu Ala Ile Ile Cys Phe Va - #l Phe Ala Gln Lys                          305  - #               310  - #               315              - - Leu Arg Leu Pro Gln Leu Gln Thr Gly Lys Il - #e Leu Arg Trp Glu                          320  - #               325  - #               330              - - Tyr Glu Phe Val Ile His Gly His Gly Leu Se - #r Glu Lys Thr Ser                          335  - #               340  - #               345              - - Asn Ser Leu Ala Arg Thr Gly Ser Pro Tyr Le - #u Leu Gly Arg Asp                          350  - #               355  - #               360              - - Ser Phe Thr Gln Asn Pro Asn Ala Val Ala Ph - #e Met Ser Gly Leu                          365  - #               370  - #               375              - - Asn Asn Met Glu Leu Ala Asn Thr Asp His Gl - #u Trp Ser Val Gly                          380  - #               385  - #               390              - - Ser Ser Ser Pro Glu Pro Leu Pro Ser Pro Al - #a Ala Asp Leu Ile                          395  - #               400  - #               405              - - Asp Arg Pro Ala Ser Thr Thr Ser Ser Ile Ph - #e Asp Ala Ser His                          410  - #               415  - #               420              - - Leu Pro                                                                   - -  - - (2) INFORMATION FOR SEQ ID NO:3:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH:  2385                                                             (B) TYPE:  nucleic a - #cid                                                   (C) STRANDEDNESS:  doub - #le                                                 (D) TOPOLOGY:  linear                                                - -     (ii) MOLECULE TYPE: genomic DNA                                       - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                               - - AAGCTTTTTT GCCTCTGCAA AAGTTCCTTT CTCGAATTGG TTTTTTGAGG AA -             #AAGCAAGT     60                                                                 - - TAATAAACTA ATTATATTAT ATATAATTAG CAATTTTATA AAAAAAATAA AA -            #AAATAGCC    120                                                                 - - CTGATTGCTG GCAACTGTGA GCTGAACATT GGTTAATCGG TCCATCTTTT TT -            #TAAATATT    180                                                                 - - TTACATCGCT ACTTTTAAGT GCTTGACACT TGCATTTAAT AGCTACTTTC TT -            #TCCTTCAT    240                                                                 - - AAAAATTCCT TTTTTTTCCT TTAGTTTTCC GGTTAATTCC TTACGAAATT TT -            #TTTCGTAC    300                                                                 - - GCTTCCCTTT TTTACTCTGA TAATTCTTTG AAGCAATGTC TGCTCTTTCG AC -            #CTTAAAAA    360                                                                 - - AGCGCCTTGC TGCGTGTAAC CGAGCATCCC AATACAAGTT GGAAACAAGC TT -            #AAACCCTA    420                                                                 - - TGCCTACATT TCGTTTGCTA CGCAATACGA AATGGTCATG GACACATTTG CA -            #ATATGTGT    480                                                                 - - TTCTAGCAGG TAATTTGATT TTTGCTTGTA TTGTCATTGA ATCTCCTGGA TT -            #CTGGGGGA    540                                                                 - - AATTTGGCAT TGCCTGTCTT TTGGCCATTG CGTTGACCGT TCCTTTAACA CG -            #CCAAATTT    600                                                                 - - TTTTTCCTGC CATTGTTATC ATCACCTGGG CAATTTTATT TTACTCTTGT AG -            #GTTTATTC    660                                                                 - - CAGAACGCTG GCGTCCACCC ATATGGGTTC GTGTTTTACC CACACTTGAA AA -            #TATTCTTT    720                                                                 - - ATGGCTCTAA TCTTTCTAGT CTTCTCTCGA AAACCACGCA TAGCATCCTT GA -            #TATTTTGG    780                                                                 - - CCTGGGTTCC ATATGGAGTC ATGCATTATT CGGCTCCTTT TATCATTTCA TT -            #TATTCTTT    840                                                                 - - TCATCTTTGC ACCTCCTGGA ACTCTTCCAG TTTGGGCTCG AACTTTTGGT TA -            #TATGAATT    900                                                                 - - TATTTGGTGT TCTTATCCAA ATGGCTTTCC CCTGTTCTCC TCCTTGGTAT GA -            #AAATATGT    960                                                                 - - ATGGTTTAGA ACCTGCCACG TATGCAGTAC GTGGCTCTCC TGGTGGATTG GC -            #CCGTATTG   1020                                                                 - - ATGCTCTCTT CGGCACTAGC ATTTACACTG ATGGTTTTTC TAACTCTCCG GT -            #TGTTTTTG   1080                                                                 - - GTGCCTTTCC ATCTCTTCAC GCTGGATGGG CCATGCTGGA AGCACTTTTC CT -            #TTCGCATG   1140                                                                 - - TGTTTCCTCG ATACCGCTTC TGCTTTTATG GATATGTTCT ATGGCTTTGC TG -            #GTGTACTA   1200                                                                 - - TGTACCTTAC CCACCACTAC TTTGTAGATT TGGTCGGCGG TATGTGTTTA GC -            #TATTATAT   1260                                                                 - - GCTTCGTTTT TGCTCAAAAG CTACGCCTCC CACAGTTGCA AACTGGTAAA AT -            #CCTTCGTT   1320                                                                 - - GGGAATACGA GTTTGTTATC CACGGTCATG GTCTTTCCGA AAAAACCAGC AA -            #CTCCTTGG   1380                                                                 - - CTCGTACCGG CAGCCCATAC TTACTTGGAA GGGATTCTTT TACTCAAAAC CC -            #TAATGCAG   1440                                                                 - - TAGCCTTCAT GAGTGGTCTT AACAATATGG AACTTGCTAA CACCGATCAT GA -            #ATGGTCCG   1500                                                                 - - TGGGTTCATC ATCACCTGAG CCGTTACCTA GTCCTGCTGC TGATTTGATT GA -            #TCGTCCTG   1560                                                                 - - CCAGTACCAC TTCCTCCATC TTTGATGCAA GTCATCTTCC TTAAATCAAC GT -            #GCTTTAAG   1620                                                                 - - AATATATTTC CAAAAGCTAC ATGATACATT GACTAGAATC GGTTTGATTC AT -            #AGTGGTAT   1680                                                                 - - TGGAATGATG TTGTTCATTG TGTTTTTTAA CTGTTAATCT GACATCCATT GA -            #GTCATTCT   1740                                                                 - - TTACAATTTG TAAAATTAAT TTGTATCACT AATTTTGAAG GAAGCTATTT TG -            #GTATTAAT   1800                                                                 - - ACCGCTTTTG GTCTCCACTT CCTTTTCGAA ACTCTTAACA GCGATTAGGC CG -            #GGTATCTT   1860                                                                 - - CCAGTGTGAT GTATAGGTAT TTGTCGTTTT TTTATCATTT CCGTTAATAA AG -            #AACTCTTT   1920                                                                 - - TATCCAGCTT CTTACACTGT CAACTGTTGT GAAAGGAACA CATTTAGAAT TT -            #CATTTTCC   1980                                                                 - - TTATTTGTTG TGATTTAAAT CGTTTGACAT AATTTTAAAT TTGGTTTGAA AT -            #GTGTGTGA   2040                                                                 - - GAAGGCTTGT TTTATTCATT TAGTTTATTG CTTGTTTGCA CGAAAATCCA GA -            #ACGGAGCA   2100                                                                 - - TTAATGTAAT CCTTTTTTAT TCTGTAAAGC GTTTTTATAC AAATGTTGGT TA -            #TACGTTTC   2160                                                                 - - TAAAATAAGA ATATTGTTAT AATAATATAG TTTTTTCTAT CATTTGTTAC AC -            #ACACTAAA   2220                                                                 - - GAGACATTAA GGATAAGCAA ATGTGTTAAA ATGATAATAT ATTTTGGAAA CA -            #TTTATAAA   2280                                                                 - - GAAATTAAGC AGCTTTGACT AACTACATTT TTGTTTTTTT CCTAAGCAAA AC -            #TGTATAGT   2340                                                                 - - TATACACGCG AGCTGTATTC ACTTCCATTG TAGTGACTTG AGCTC   - #                    2385                                                                        - -  - - (2) INFORMATION FOR SEQ ID NO:4:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH:  422                                                              (B) TYPE:  amino aci - #d                                                     (C) STRANDEDNESS:  sing - #le                                                 (D) TOPOLOGY:  linear                                                - -     (ii) MOLECULE TYPE: peptide                                           - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                               - - Met Ser Ala Leu Ser Thr Leu Lys Lys Arg Le - #u Ala Ala Cys Asn            1               5 - #                 10 - #                 15              - - Arg Ala Ser Gln Tyr Lys Leu Glu Thr Ser Le - #u Asn Pro Met Pro                           20 - #                 25 - #                 30              - - Thr Phe Arg Leu Leu Arg Asn Thr Lys Trp Se - #r Trp Thr His Leu                           35 - #                 40 - #                 45              - - Gln Tyr Val Phe Leu Ala Gly Asn Leu Ile Ph - #e Ala Cys Ile Val                           50 - #                 55 - #                 60              - - Ile Glu Ser Pro Gly Phe Trp Gly Lys Phe Gl - #y Ile Ala Cys Leu                           65 - #                 70 - #                 75              - - Leu Ala Ile Ala Leu Thr Val Pro Leu Thr Ar - #g Gln Ile Phe Phe                           80 - #                 85 - #                 90              - - Pro Ala Ile Val Ile Ile Thr Trp Ala Ile Le - #u Phe Tyr Ser Cys                           95 - #                100 - #                105              - - Arg Phe Ile Pro Glu Arg Trp Arg Pro Pro Il - #e Trp Val Arg Val                          110  - #               115  - #               120              - - Leu Pro Thr Leu Glu Asn Ile Leu Tyr Gly Se - #r Asn Leu Ser Ser                          125  - #               130  - #               135              - - Leu Leu Ser Lys Thr Thr His Ser Ile Leu As - #p Ile Leu Ala Trp                          140  - #               145  - #               150              - - Val Pro Tyr Gly Val Met His Tyr Ser Ala Pr - #o Phe Ile Ile Ser                          155  - #               160  - #               165              - - Phe Ile Leu Phe Ile Phe Ala Pro Pro Gly Th - #r Leu Pro Val Trp                          170  - #               175  - #               180              - - Ala Arg Thr Phe Gly Tyr Met Asn Leu Phe Gl - #y Val Leu Ile Gln                          185  - #               190  - #               195              - - Met Ala Phe Pro Cys Ser Pro Pro Trp Tyr Gl - #u Asn Met Tyr Gly                          200  - #               205  - #               210              - - Leu Glu Pro Ala Thr Tyr Ala Val Arg Gly Se - #r Pro Gly Gly Leu                          215  - #               220  - #               225              - - Ala Arg Ile Asp Ala Leu Phe Gly Thr Ser Il - #e Tyr Thr Asp Gly                          230  - #               235  - #               240              - - Phe Ser Asn Ser Pro Val Val Phe Gly Ala Ph - #e Pro Ser Leu His                          245  - #               250  - #               255              - - Ala Gly Trp Ala Met Leu Glu Ala Leu Phe Le - #u Ser His Val Phe                          260  - #               265  - #               270              - - Pro Arg Tyr Arg Phe Cys Phe Tyr Gly Tyr Va - #l Leu Trp Leu Cys                          275  - #               280  - #               285              - - Trp Cys Thr Met Tyr Leu Thr His His Tyr Ph - #e Val Asp Leu Val                          290  - #               295  - #               300              - - Gly Gly Met Cys Leu Ala Ile Ile Cys Phe Va - #l Phe Ala Gln Lys                          305  - #               310  - #               315              - - Leu Arg Leu Pro Gln Leu Gln Thr Gly Lys Il - #e Leu Arg Trp Glu                          320  - #               325  - #               330              - - Tyr Glu Phe Val Ile His Gly His Gly Leu Se - #r Glu Lys Thr Ser                          335  - #               340  - #               345              - - Asn Ser Leu Ala Arg Thr Gly Ser Pro Tyr Le - #u Leu Gly Arg Asp                          350  - #               355  - #               360              - - Ser Phe Thr Gln Asn Pro Asn Ala Val Ala Ph - #e Met Ser Gly Leu                          365  - #               370  - #               375              - - Asn Asn Met Glu Leu Ala Asn Thr Asp His Gl - #u Trp Ser Val Gly                          380  - #               385  - #               390              - - Ser Ser Ser Pro Glu Pro Leu Pro Ser Pro Al - #a Ala Asp Leu Ile                          395  - #               400  - #               405              - - Asp Arg Pro Ala Ser Thr Thr Ser Ser Ile Ph - #e Asp Ala Ser His                          410  - #               415  - #               420              - - Leu Pro                                                                   - -  - - (2) INFORMATION FOR SEQ ID NO:5:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH:  2340                                                             (B) TYPE:  nucleic a - #cid                                                   (C) STRANDEDNESS:  doub - #le                                                 (D) TOPOLOGY:  linear                                                - -     (ii) MOLECULE TYPE: genomic DNA                                       - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                               - - TTTCTTTCTG TCAAAGAATA ATAAAGTGCC CATCAGTGTT CATATTTGTT AC -             #AAAGTGGT     60                                                                 - - TTTCTGATTT GGTACTACTG CAGAGGCGTA TTTTTTGCTT CAGTTACCAT AG -            #CGTAAGAA    120                                                                 - - CACTAGCGAC TTTTGTTCGT GAACCAACAG AGTAGGATTT CTACTGCTAC AT -            #CTCTTAGG    180                                                                 - - TAGTTGGTTA GTCCGATCGC TCACTTTTGG TTGTTGTTAA GTACTTCATA AG -            #TTTATCCT    240                                                                 - - TTTCCTTTTT CACACTGAGC TACTTTGGGT ATAGCTTTTG GCCCAAGGAT CT -            #TTGAATTT    300                                                                 - - TCTCCAAAAG TACTTTATTT TATATCCTAC AGGTTGCGGT TTTCATATTT TA -            #AAAAGCTT    360                                                                 - - TTTAATCATT CCTTTGCGTA TGGCAAACCC TTTTTCGAGA TGGTTTCTAT CA -            #GAGAGACC    420                                                                 - - TCCAAACTGC CATGTAGCCG ATTTAGAAAC AAGTTTAGAT CCCCATCAAA CG -            #TTGTTGAA    480                                                                 - - GGTGCAAAAA TACAAACCCG CTTTAAGCGA CTGGGTGCAT TACATCTTCT TG -            #GGATCCAT    540                                                                 - - CATGCTGTTT GTGTTCATTA CTAATCCCGC ACCTTGGATC TTCAAGATCC TT -            #TTTTATTG    600                                                                 - - TTTCTTGGGC ACTTTATTCA TCATTCCAGC TACGTCACAG TTTTTCTTCA AT -            #GCCTTGCC    660                                                                 - - CATCCTAACA TGGGTGGCGC TGTATTTCAC TTCATCGTAC TTTCCAGATG AC -            #CGCAGGCC    720                                                                 - - TCCTATTACT GTCAAAGTGT TACCAGCGGT GGAAACAATT TTATACGGCG AC -            #AATTTAAG    780                                                                 - - TGATATTCTT GCAACATCGA CGAATTCCTT TTTGGACATT TTAGCATGGT TA -            #CCGTACGG    840                                                                 - - ACTATTTCAT TATGGGGCCC CATTTGTCGT TGCTGCCATC TTATTCGTAT TT -            #GGTCCACC    900                                                                 - - AACTGTTTTG CAAGGTTATG CTTTTGCATT TGGTTATATG AACCTGTTTG GT -            #GTTATCAT    960                                                                 - - GCAAAATGTC TTTCCAGCCG CTCCCCCATG GTATAAAATT CTCTATGGAT TG -            #CAATCAGC   1020                                                                 - - CAACTATGAT ATGCATGGCT CGCCTGGTGG ATTAGCTAGA ATTGATAAGC TA -            #CTCGGTAT   1080                                                                 - - TAATATGTAT ACTACAGCTT TTTCAAATTC CTCCGTCATT TTCGGTGCTT TT -            #CCTTCACT   1140                                                                 - - GCATTCCGGG TGTGCTACTA TGGAAGCCCT GTTTTTCTGT TATTGTTTTC CA -            #AAATTGAA   1200                                                                 - - GCCCTTGTTT ATTGCTTATG TTTGCTGGTT ATGGTGGTCA ACTATGTATC TG -            #ACACACCA   1260                                                                 - - TTATTTTGTA GACCTTATGG CAGGTTCTGT GCTGTCATAC GTTATTTTCC AG -            #TACACAAA   1320                                                                 - - GTACACACAT TTACCAATTG TAGATACATC TCTTTTTTGC AGATGGTCAT AC -            #ACTTCAAT   1380                                                                 - - TGAGAAATAC GATATATCAA AGAGTGATCC ATTGGCTGCA GATTCAAACG AT -            #ATCGAAAG   1440                                                                 - - TGTCCCTTTG TCCAACTTGG AACTTGACTT TGATCTTAAT ATGACTGATG AA -            #CCCAGTGT   1500                                                                 - - AAGCCCTTCG TTATTTGATG GATCTACTTC TGTTTCTCGT TCGTCCGCCA CG -            #TCTATAAC   1560                                                                 - - GTCACTAGGT GTAAAGAGGG CTTAATGAGT ATTTTATCTG CAATTACGGA TA -            #CGGTTGGT   1620                                                                 - - CTTATGTAGA TACATATAAA TATATATCTT TTTCTTTCTT TTTCTTAGTC AG -            #GATTGTCG   1680                                                                 - - TTTAGCATAA TATACATGTA GTTTATTTAA TCACATACCA CTGATTATCT TT -            #AGAATTTT   1740                                                                 - - ATAAATTTTT GAAATAAATG GGTGGCTTTT AATGGTGTCT ATGTTAAGTG AG -            #GCTTTTAG   1800                                                                 - - AATGCTCTTC CTGCTTTGTT TATTATATGT GTATGAAAGA TATGTATGTA TT -            #TACATGTG   1860                                                                 - - TTTGTAGCGT CCCCAGTCAA AACCTGTGCG CTATACCTAA ATGGATTGAT AA -            #TCTTCATT   1920                                                                 - - CACTAATTCT AAAATAGACT TCTTCCCCAA AGAACGGTGT AACGATGAGG CT -            #CTATCCAG   1980                                                                 - - CTGCTTATCT AAATCAACTT TAACGATGGA TGATCTTATG ACACGGGGAT CT -            #TTCTTTAA   2040                                                                 - - AGTTCTTAGA ATTTCAGACT GTACCGCAGC TGATGAATCA AACAGCATTA AA -            #AAGTGATA   2100                                                                 - - TGCTCGAAAA TGTTTTTCCT GGTCTTTCTT CATTATTTTA GGAAGATACC TT -            #ATGCCCAT   2160                                                                 - - GGGTACAATG TCCCTCACCA CACCTCTGTT TTGAATAATC AGTTTCCCGA TT -            #GTGGAAGA   2220                                                                 - - CAATTCTTTT GCTTCCAACT TTGGCGCATT GGAGTTGGTT ATGCGAACAA GT -            #CCGATCAG   2280                                                                 - - CTCATAAAGC ATCTTAGTGA AAAGGGTGGT TTTGCGTTAT TCTTTCCTCT GT -            #TGAAGCTT   2340                                                                 - -  - - (2) INFORMATION FOR SEQ ID NO:6:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH:  401                                                              (B) TYPE:  amino aci - #d                                                     (C) STRANDEDNESS:  sing - #le                                                 (D) TOPOLOGY:  linear                                                - -     (ii) MOLECULE TYPE: peptide                                           - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                               - - Met Ala Asn Pro Phe Ser Arg Trp Phe Leu Se - #r Glu Arg Pro Pro           1               5 - #                 10 - #                 15              - - Asn Cys His Val Ala Asp Leu Glu Thr Ser Le - #u Asp Pro His Gln                           20 - #                 25 - #                 30              - - Thr Leu Leu Lys Val Gln Lys Tyr Lys Pro Al - #a Leu Ser Asp Trp                           35 - #                 40 - #                 45              - - Val His Tyr Ile Phe Leu Gly Ser Ile Met Le - #u Phe Val Phe Ile                           50 - #                 55 - #                 60              - - Thr Asn Pro Ala Pro Trp Ile Phe Lys Ile Le - #u Phe Tyr Cys Phe                           65 - #                 70 - #                 75              - - Leu Gly Thr Leu Phe Ile Ile Pro Ala Thr Se - #r Gln Phe Phe Phe                           80 - #                 85 - #                 90              - - Asn Ala Leu Pro Ile Leu Thr Trp Val Ala Le - #u Tyr Phe Thr Ser                           95 - #                100 - #                105              - - Ser Tyr Phe Pro Asp Asp Arg Arg Pro Pro Il - #e Thr Val Lys Val                          110  - #               115  - #               120              - - Leu Pro Ala Val Glu Thr Ile Leu Tyr Gly As - #p Asn Leu Ser Asp                          125  - #               130  - #               135              - - Ile Leu Ala Thr Ser Thr Asn Ser Phe Leu As - #p Ile Leu Ala Trp                          140  - #               145  - #               150              - - Leu Pro Tyr Gly Leu Phe His Tyr Gly Ala Pr - #o Phe Val Val Ala                          155  - #               160  - #               165              - - Ala Ile Leu Phe Val Phe Gly Pro Pro Thr Va - #l Leu Gln Gly Tyr                          170  - #               175  - #               180              - - Ala Phe Ala Phe Gly Tyr Met Asn Leu Phe Gl - #y Val Ile Met Gln                          185  - #               190  - #               195              - - Asn Val Phe Pro Ala Ala Pro Pro Trp Tyr Ly - #s Ile Leu Tyr Gly                          200  - #               205  - #               210              - - Leu Gln Ser Ala Asn Tyr Asp Met His Gly Se - #r Pro Gly Gly Leu                          215  - #               220  - #               225              - - Ala Arg Ile Asp Lys Leu Leu Gly Ile Asn Me - #t Tyr Thr Thr Ala                          230  - #               235  - #               240              - - Phe Ser Asn Ser Ser Val Ile Phe Gly Ala Ph - #e Pro Ser Leu His                          245  - #               250  - #               255              - - Ser Gly Cys Ala Thr Met Glu Ala Leu Phe Ph - #e Cys Tyr Cys Phe                          260  - #               265  - #               270              - - Pro Lys Leu Lys Pro Leu Phe Ile Ala Tyr Va - #l Cys Trp Leu Trp                          275  - #               280  - #               285              - - Trp Ser Thr Met Tyr Leu Thr His His Tyr Ph - #e Val Asp Leu Met                          290  - #               295  - #               300              - - Ala Gly Ser Val Leu Ser Tyr Val Ile Phe Gl - #n Tyr Thr Lys Tyr                          305  - #               310  - #               315              - - Thr His Leu Pro Ile Val Asp Thr Ser Leu Ph - #e Cys Arg Trp Ser                          320  - #               325  - #               330              - - Tyr Thr Ser Ile Glu Lys Tyr Asp Ile Ser Ly - #s Ser Asp Pro Leu                          335  - #               340  - #               345              - - Ala Ala Asp Ser Asn Asp Ile Glu Ser Val Pr - #o Leu Ser Asn Leu                          350  - #               355  - #               360              - - Glu Leu Asp Phe Asp Leu Asn Met Thr Asp Gl - #u Pro Ser Val Ser                          365  - #               370  - #               375              - - Pro Ser Leu Phe Asp Gly Ser Thr Ser Val Se - #r Arg Ser Ser Ala                          380  - #               385  - #               390              - - Thr Ser Ile Thr Ser Leu Gly Val Lys Arg Al - #a                                          395  - #               400                                     - -  - - (2) INFORMATION FOR SEQ ID NO:7:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH:  2340                                                             (B) TYPE:  nucleic a - #cid                                                   (C) STRANDEDNESS:  doub - #le                                                 (D) TOPOLOGY:  linear                                                - -     (ii) MOLECULE TYPE: genomic DNA                                       - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:                               - - TTTCTTTCTG TCAAAGAATA ATAAAGTGCC CATCAGTGTT CATATTTGTT AC -             #AAAGTGGT     60                                                                 - - TTTCTGATTT GGTACTACTG CAGAGGCGTA TTTTTTGCTT CAGTTACCAT AG -            #CGTAAGAA    120                                                                 - - CACTAGCGAC TTTTGTTCGT GAACCAACAG AGTAGGATTT CTACTGCTAC AT -            #CTCTTAGG    180                                                                 - - TAGTTGGTTA GTCCGATCGC TCACTTTTGG TTGTTGTTAA GTACTTCATA AG -            #TTTATCCT    240                                                                 - - TTTCCTTTTT CACACTGAGC TACTTTGGGT ATAGCTTTTG GCCCAAGGAT CT -            #TTGAATTT    300                                                                 - - TCTCCAAAAG TACTTTATTT TATATCCTAC AGGTTGCGGT TTTCATATTT TA -            #AAAAGCTT    360                                                                 - - TTTAATCATT CCTTTGCGTA TGGCAAACCC TTTTTCGAGA TGGTTTCTAT CA -            #GAGAGACC    420                                                                 - - TCCAAACTGC CATGTAGCCG ATTTAGAAAC AAGTTTAGAT CCCCATCAAA CG -            #TTGTTGAA    480                                                                 - - GGTGCAAAAA TACAAACCCG CTTTAAGCGA CTGGGTGCAT TACATCTTCT TG -            #GGATCCAT    540                                                                 - - CATGCTGTTT GTGTTCATTA CTAATCCCGC ACCTTGGATC TTCAAGATCC TT -            #TTTTATTG    600                                                                 - - TTTCTTGGGC ACTTTATTCA TCATTCCAGC TACGTCACAG TTTTTCTTCA AT -            #GCCTTGCC    660                                                                 - - CATCCTAACA TGGGTGGCGC TGTATTTCAC TTCATCGTAC TTTCCAGATG AC -            #CGCAGGCC    720                                                                 - - TCCTATTACT GTCAAAGTGT TACCAGCGGT GGAAACAATT TTATACGGCG AC -            #AATTTAAG    780                                                                 - - TGATATTCTT GCAACATCGA CGAATTCCTT TTTGGACATT TTAGCATGGT TA -            #CCGTACGG    840                                                                 - - ACTATTTCAT TTTGGGGCCC CATTTGTCGT TGCTGCCATC TTATTCGTAT TT -            #GGTCCACC    900                                                                 - - AACTGTTTTG CAAGGTTATG CTTTTGCATT TGGTTATATG AACCTGTTTG GT -            #GTTATCAT    960                                                                 - - GCAAAATGTC TTTCCAGCCG CTCCCCCATG GTATAAAATT CTCTATGGAT TG -            #CAATCAGC   1020                                                                 - - CAACTATGAT ATGCATGGCT CGCCTGGTGG ATTAGCTAGA ATTGATAAGC TA -            #CTCGGTAT   1080                                                                 - - TAATATGTAT ACTACAGCTT TTTCAAATTC CTCCGTCATT TTCGGTGCTT TT -            #CCTTCACT   1140                                                                 - - GCATTCCGGG TGTGCTACTA TGGAAGCCCT GTTTTTCTGT TATTGTTTTC CA -            #AAATTGAA   1200                                                                 - - GCCCTTGTTT ATTGCTTATG TTTGCTGGTT ATGGTGGTCA ACTATGTATC TG -            #ACACACCA   1260                                                                 - - TTATTTTGTA GACCTTATGG CAGGTTCTGT GCTGTCATAC GTTATTTTCC AG -            #TACACAAA   1320                                                                 - - GTACACACAT TTACCAATTG TAGATACATC TCTTTTTTGC AGATGGTCAT AC -            #ACTTCAAT   1380                                                                 - - TGAGAAATAC GATATATCAA AGAGTGATCC ATTGGCTGCA GATTCAAACG AT -            #ATCGAAAG   1440                                                                 - - TGTCCCTTTG TCCAACTTGG AACTTGACTT TGATCTTAAT ATGACTGATG AA -            #CCCAGTGT   1500                                                                 - - AAGCCCTTCG TTATTTGATG GATCTACTTC TGTTTCTCGT TCGTCCGCCA CG -            #TCTATAAC   1560                                                                 - - GTCACTAGGT GTAAAGAGGG CTTAATGAGT ATTTTATCTG CAATTACGGA TA -            #CGGTTGGT   1620                                                                 - - CTTATGTAGA TACATATAAA TATATATCTT TTTCTTTCTT TTTCTTAGTC AG -            #GATTGTCG   1680                                                                 - - TTTAGCATAA TATACATGTA GTTTATTTAA TCACATACCA CTGATTATCT TT -            #AGAATTTT   1740                                                                 - - ATAAATTTTT GAAATAAATG GGTGGCTTTT AATGGTGTCT ATGTTAAGTG AG -            #GCTTTTAG   1800                                                                 - - AATGCTCTTC CTGCTTTGTT TATTATATGT GTATGAAAGA TATGTATGTA TT -            #TACATGTG   1860                                                                 - - TTTGTAGCGT CCCCAGTCAA AACCTGTGCG CTATACCTAA ATGGATTGAT AA -            #TCTTCATT   1920                                                                 - - CACTAATTCT AAAATAGACT TCTTCCCCAA AGAACGGTGT AACGATGAGG CT -            #CTATCCAG   1980                                                                 - - CTGCTTATCT AAATCAACTT TAACGATGGA TGATCTTATG ACACGGGGAT CT -            #TTCTTTAA   2040                                                                 - - AGTTCTTAGA ATTTCAGACT GTACCGCAGC TGATGAATCA AACAGCATTA AA -            #AAGTGATA   2100                                                                 - - TGCTCGAAAA TGTTTTTCCT GGTCTTTCTT CATTATTTTA GGAAGATACC TT -            #ATGCCCAT   2160                                                                 - - GGGTACAATG TCCCTCACCA CACCTCTGTT TTGAATAATC AGTTTCCCGA TT -            #GTGGAAGA   2220                                                                 - - CAATTCTTTT GCTTCCAACT TTGGCGCATT GGAGTTGGTT ATGCGAACAA GT -            #CCGATCAG   2280                                                                 - - CTCATAAAGC ATCTTAGTGA AAAGGGTGGT TTTGCGTTAT TCTTTCCTCT GT -            #TGAAGCTT   2340                                                                 - -  - - (2) INFORMATION FOR SEQ ID NO:8:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH:  401                                                              (B) TYPE:  amino aci - #d                                                     (C) STRANDEDNESS:  sing - #le                                                 (D) TOPOLOGY:  linear                                                - -     (ii) MOLECULE TYPE: peptide                                           - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:                               - - Met Ala Asn Pro Phe Ser Arg Trp Phe Leu Se - #r Glu Arg Pro Pro           1               5 - #                 10 - #                 15              - - Asn Cys His Val Ala Asp Leu Glu Thr Ser Le - #u Asp Pro His Gln                           20 - #                 25 - #                 30              - - Thr Leu Leu Lys Val Gln Lys Tyr Lys Pro Al - #a Leu Ser Asp Trp                           35 - #                 40 - #                 45              - - Val His Tyr Ile Phe Leu Gly Ser Ile Met Le - #u Phe Val Phe Ile                           50 - #                 55 - #                 60              - - Thr Asn Pro Ala Pro Trp Ile Phe Lys Ile Le - #u Phe Tyr Cys Phe                           65 - #                 70 - #                 75              - - Leu Gly Thr Leu Phe Ile Ile Pro Ala Thr Se - #r Gln Phe Phe Phe                           80 - #                 85 - #                 90              - - Asn Ala Leu Pro Ile Leu Thr Trp Val Ala Le - #u Tyr Phe Thr Ser                           95 - #                100 - #                105              - - Ser Tyr Phe Pro Asp Asp Arg Arg Pro Pro Il - #e Thr Val Lys Val                          110  - #               115  - #               120              - - Leu Pro Ala Val Glu Thr Ile Leu Tyr Gly As - #p Asn Leu Ser Asp                          125  - #               130  - #               135              - - Ile Leu Ala Thr Ser Thr Asn Ser Phe Leu As - #p Ile Leu Ala Trp                          140  - #               145  - #               150              - - Leu Pro Tyr Gly Leu Phe His Phe Gly Ala Pr - #o Phe Val Val Ala                          155  - #               160  - #               165              - - Ala Ile Leu Phe Val Phe Gly Pro Pro Thr Va - #l Leu Gln Gly Tyr                          170  - #               175  - #               180              - - Ala Phe Ala Phe Gly Tyr Met Asn Leu Phe Gl - #y Val Ile Met Gln                          185  - #               190  - #               195              - - Asn Val Phe Pro Ala Ala Pro Pro Trp Tyr Ly - #s Ile Leu Tyr Gly                          200  - #               205  - #               210              - - Leu Gln Ser Ala Asn Tyr Asp Met His Gly Se - #r Pro Gly Gly Leu                          215  - #               220  - #               225              - - Ala Arg Ile Asp Lys Leu Leu Gly Ile Asn Me - #t Tyr Thr Thr Ala                          230  - #               235  - #               240              - - Phe Ser Asn Ser Ser Val Ile Phe Gly Ala Ph - #e Pro Ser Leu His                          245  - #               250  - #               255              - - Ser Gly Cys Ala Thr Met Glu Ala Leu Phe Ph - #e Cys Tyr Cys Phe                          260  - #               265  - #               270              - - Pro Lys Leu Lys Pro Leu Phe Ile Ala Tyr Va - #l Cys Trp Leu Trp                          275  - #               280  - #               285              - - Trp Ser Thr Met Tyr Leu Thr His His Tyr Ph - #e Val Asp Leu Met                          290  - #               295  - #               300              - - Ala Gly Ser Val Leu Ser Tyr Val Ile Phe Gl - #n Tyr Thr Lys Tyr                          305  - #               310  - #               315              - - Thr His Leu Pro Ile Val Asp Thr Ser Leu Ph - #e Cys Arg Trp Ser                          320  - #               325  - #               330              - - Tyr Thr Ser Ile Glu Lys Tyr Asp Ile Ser Ly - #s Ser Asp Pro Leu                          335  - #               340  - #               345              - - Ala Ala Asp Ser Asn Asp Ile Glu Ser Val Pr - #o Leu Ser Asn Leu                          350  - #               355  - #               360              - - Glu Leu Asp Phe Asp Leu Asn Met Thr Asp Gl - #u Pro Ser Val Ser                          365  - #               370  - #               375              - - Pro Ser Leu Phe Asp Gly Ser Thr Ser Val Se - #r Arg Ser Ser Ala                          380  - #               385  - #               390              - - Thr Ser Ile Thr Ser Leu Gly Val Lys Arg Al - #a                                          395  - #               400                                     - -  - - (2) INFORMATION FOR SEQ ID NO:9:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH:  5340                                                             (B) TYPE:  nucleic a - #cid                                                   (C) STRANDEDNESS:  doub - #le                                                 (D) TOPOLOGY:  linear                                                - -     (ii) MOLECULE TYPE: genomic DNA                                       - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:                               - - AGCGCTTCTA TTTTCCTCCC CACCGCGAGG CGGAAATGGC ACATTTTTTT TC -             #TTTTGCTT     60                                                                 - - CTGTGCTTTT GCTGTAATTT TTGGCATGTG CTATTGTATG AAGATAACGC GT -            #GGTTCCGT    120                                                                 - - GGAAATAGCC GGAAATTTTG CCGGGAATAT GACGGACATG ATTTAACACC CG -            #TGGAAATG    180                                                                 - - AAAAAAGCCA AGGTAAGAAA GTGGCAATAT TTTTCCTACA AATAGATCTG CT -            #GTCCCTTA    240                                                                 - - GATGATTACC ATACATATAT ATATTTATTA CACACATCTG TCAGAGGTAG CT -            #AGCGAAGG    300                                                                 - - TGTCACTGAA ATATTTTTTG TTCCAGTTAG TATAAATACG GAGGTAGAAC AG -            #CTCTCCGC    360                                                                 - - GTGTATATCT TTTTTTGCGC TATACAAGAA CAGGAAGAAC GCATTTCCAT AC -            #CTTTTTCT    420                                                                 - - CCTTACAGGT GCCCTCTGAG TAGTGTCACG AACGAGGAAA AAGATTAATA TT -            #ACTGTTTT    480                                                                 - - TATATTCAAA AAGAGTAAAG CCGTTGCTAT ATACGAATAT GACGATTACC GT -            #GGGGGATG    540                                                                 - - CAGTTTCGGA GACGGAGCTG GAAAACAAAA GTCAAAACGT GGTACTATCT CC -            #CAAGGCAT    600                                                                 - - CTGCTTCTTC AGACATAAGC ACAGATGTTG ATAAAGACAC ATCGTCTTCT TG -            #GGATGACA    660                                                                 - - AATCTTTGCT GCCTACAGGT GAATATATTG TGGACAGAAA TAAGCCCCAA AC -            #CTACTTGA    720                                                                 - - ATAGCGATGA TATCGAAAAA GTGACAGAAT CTGATATTTT CCCTCAGAAA CG -            #TCTGTTTT    780                                                                 - - CATTCTTGCA CTCTAAGAAA ATTCCAGAAG TACCACAAAC CGATGACGAG AG -            #GAAGATAT    840                                                                 - - ATCCTCTGTT CCATACAAAT ATTATCTCTA ACATGTTTTT TTGGTGGGTT CT -            #ACCCATCC    900                                                                 - - TGCGAGTTGG TTATAAGAGA ACGATACAGC CGAACGATCT CTTCAAAATG GA -            #TCCGAGGA    960                                                                 - - TGTCTATAGA GACCCTTTAT GACGACTTTG AAAAAAACAT GATTTACTAT TT -            #TGAGAAGA   1020                                                                 - - CGAGGAAAAA ATACCGTAAA AGACATCCAG AAGCGACAGA AGAAGAGGTT AT -            #GGAAAATG   1080                                                                 - - CCAAACTACC TAAACATACA GTTCTGAGAG CTTTATTATT CACTTTTAAG AA -            #ACAGTACT   1140                                                                 - - TCATGTCGAT AGTGTTTGCA ATTCTCGCTA ATTGTACATC CGGTTTTAAC CC -            #CATGATTA   1200                                                                 - - CCAAGAGGCT AATTGAGTTT GTCGAAGAAA AGGCTATTTT TCATAGCATG CA -            #TGTTAACA   1260                                                                 - - AAGGTATTGG TTACGCTATT GGTGCATGTT TGATGATGTT CGTTAACGGG TT -            #GACGTTCA   1320                                                                 - - ATCATTTCTT TCATACATCC CAACTGACTG GTGTGCAAGC TAAGTCTATT CT -            #TACTAAAG   1380                                                                 - - CTGCCATGAA GAAAATGTTT AATGCATCTA ATTATGCGAG ACATTGTTTT CC -            #TAACGGTA   1440                                                                 - - AAGTGACTTC TTTTGTAACA ACAGATCTCG CTAGAATTGA ATTTGCCTTA TC -            #TTTTCAGC   1500                                                                 - - CGTTTTTGGC TGGGTTCCCT GCAATTTTGG CTATTTGCAT TGTTTTATTG AT -            #CGTTAACC   1560                                                                 - - TTGGACCCAT TGCCTTAGTT GGGATTGGTA TTTTTTTCGG TGGGTTTTTC AT -            #ATCCTTAT   1620                                                                 - - TTGCATTTAA GTTAATTCTG GGCTTTAGAA TTGCTGCGAA CATCTTCACT GA -            #TGCTAGAG   1680                                                                 - - TTACCATGAT GAGAGAAGTG CTGAATAATA TAAAAATGAT TAAATATTAT AC -            #GTGGGAGG   1740                                                                 - - ATGCGTATGA AAAAAATATT CAAGATATTA GGACCAAAGA GATTTCTAAA GT -            #TAGAAAAA   1800                                                                 - - TGCAACTATC AAGAAATTTC TTGATTGCTA TGGCCATGTC TTTGCCTAGT AT -            #TGCTTCAT   1860                                                                 - - TGGTCACTTT CCTTGCAATG TACAAAGTTA ATAAAGGAGG CAGGCAACCT GG -            #TAATATTT   1920                                                                 - - TTGCCTCTTT ATCTTTATTT CAGGTCTTGA GTTTGCAAAT GTTTTTCTTA CC -            #TATTGCTA   1980                                                                 - - TTGGTACTGG AATTGACATG ATCATTGGAT TGGGCCGTTT GCAAAGCTTA TT -            #GGAGGCTC   2040                                                                 - - CAGAAGATGA TCCAAATCAG ATGATTGAAA TGAAGCCCTC TCCTGGCTTT GA -            #TCCAAAAT   2100                                                                 - - TGGCTCTAAA AATGACACAT TGCTCATTTG AGTGGGAAGA TTATGAATTA AA -            #CGACGCTA   2160                                                                 - - TTGAAGAAGC AAAAGGAGAA GCTAAAGATG AAGGTAAAAA GAACAAAAAA AA -            #GCGTAAGG   2220                                                                 - - ATACATGGGG TAAGCCATCT GCAAGTACTA ATAAGGCGAA AAGATTGGAC AA -            #TATGTTGA   2280                                                                 - - AAGACAGAGA CGGCCCGGAA GATTTAGAAA AAACTTCGTT TAGGGGTTTC AA -            #GGACTTGA   2340                                                                 - - ACTTCGATAT TAAAAAGGGC GAATTTATTA TGATTACGGG ACCTATTGGT AC -            #TGGTAAAT   2400                                                                 - - CTTCATTATT GAATGCGATG GCAGGATCAA TGAGAAAAAT TGATGGTAAG GT -            #TGAAGTCA   2460                                                                 - - ACGGGGACTT ATTAATGTGT GGTTATCCAT GGATTCAAAA TGCATCTGTA AG -            #AGATAACA   2520                                                                 - - TCATATTCGG TTCACCATTC AATAAAGAAA AGTATGATGA AGTAGTTCGT GT -            #TTGCTCTT   2580                                                                 - - TGAAAGCTGA TCTGGATATT TTACCGGCAG GCGATATGAC CGAAATTGGG GA -            #ACGTGGTA   2640                                                                 - - TTACTTTATC TGGTGGTCAA AAGGCACGTA TCAATTTAGC CAGGTCTGTT TA -            #TAAGAAGA   2700                                                                 - - AGGATATTTA TGTATTCGAC GATGTCCTAA GTGCTGTCGA TTCTCGTGTT GG -            #TAAACACA   2760                                                                 - - TCATGGATGA ATGTCTAACC GGAATGCTTG CTAATAAAAC CAGAATTTTA GC -            #AACGCATC   2820                                                                 - - AGTTGTCACT GATTGAGAGA GCTTCTAGAG TCATCGTTTT AGGTACTGAT GG -            #CCAAGTCG   2880                                                                 - - ATATTGGTAC TGTTGATGAG CTAAAAGCTC GTAATCAAAC TTTGATAAAT CT -            #TTTACAAT   2940                                                                 - - TCTCTTCTCA AAATTCGGAG AAAGAGGATG AAGAACAGGA AGCGGTTGTT TC -            #CGGTGAAT   3000                                                                 - - TGGGACAACT AAAATATGAA CCAGAGGTAA AGGAATTGAC TGAACTGAAG AA -            #AAAGGCTA   3060                                                                 - - CAGAAATGTC ACAAACTGCA AATAGTGGTA AAATTGTAGC GGATGGTCAT AC -            #TAGTAGTA   3120                                                                 - - AAGAAGAAAG AGCAGTCAAT AGTATCAGTC TGAAAATATA CCGTGAATAC AT -            #TAAAGCTG   3180                                                                 - - CAGTAGGTAA GTGGGGTTTT ATCGCACTAC CGTTGTATGC AATTTTAGTC GT -            #TGGAACCA   3240                                                                 - - CATTCTGCTC ACTTTTTTCT TCCGTTTGGT TATCTTACTG GACTGAGAAT AA -            #ATTCAAAA   3300                                                                 - - ACAGACCACC CAGTTTTTAT ATGGGTCTTT ACTCCTTCTT TGTGTTTGCT GC -            #TTTCATAT   3360                                                                 - - TCATGAATGG CCAGTTCACC ATACTTTGCG CAATGGGTAT TATGGCATCG AA -            #ATGGTTAA   3420                                                                 - - ATTTGAGGGC TGTGAAAAGA ATTTTACACA CTCCAATGTC ATACATAGAT AC -            #CACACCTT   3480                                                                 - - TGGGACGTAT TCTGAACAGA TTCACAAAAG ATACAGATAG CTTAGATAAT GA -            #GTTAACCG   3540                                                                 - - AAAGTTTACG GTTGATGACA TCTCAATTTG CTAATATTGT AGGTGTTTGC GT -            #CATGTGTA   3600                                                                 - - TTGTTTACTT GCCGTGGTTT GCTATCGCAA TTCCGTTTCT TTTGGTCATC TT -            #TGTTCTGA   3660                                                                 - - TTGCTGATCA TTATCAGAGT TCTGGTAGAG AAATTAAAAG ACTTGAAGCT GT -            #GCAACGGT   3720                                                                 - - CTTTTGTTTA CAATAATTTA AATGAAGTTT TGGGTGGGAT GGATACAATC AA -            #AGCATACC   3780                                                                 - - GAAGTCAGGA ACGATTTTTG GCGAAATCAG ATTTTTTGAT CAACAAGATG AA -            #TGAGGCGG   3840                                                                 - - GATACCTTGT AGTTGTCCTG CAAAGATGGG TAGGTATTTT CCTTGATATG GT -            #TGCTATCG   3900                                                                 - - CATTTGCACT AATTATTACG TTATTGTGTG TTACGAGAGC CTTTCCTATT TC -            #CGCGGCTT   3960                                                                 - - CAGTTGGTGT TTTGTTGACT TATGTATTAC AATTGCCTGG TCTATTAAAT AC -            #CATTTTAA   4020                                                                 - - GGGCAATGAC TCAAACAGAG AATGACATGA ATAGTGCCGA AAGATTGGTA AC -            #ATATGCAA   4080                                                                 - - CTGAACTACC ACTAGAGGCA TCCTATAGAA AGCCCGAAAT GACACCTCCA GA -            #GTCATGGC   4140                                                                 - - CCTCAATGGG CGAAATAATT TTTGAAAATG TTGATTTTGC CTATAGACCT GG -            #TTTACCTA   4200                                                                 - - TAGTTTTAAA AAATCTTAAC TTGAATATCA AGAGTGGGGA AAAAATTGGT AT -            #CTGTGGTC   4260                                                                 - - GTACAGGTGC TGGTAAGTCC ACTATTATGA GTGCCCTTTA CAGGTTGAAT GA -            #ATTGACCG   4320                                                                 - - CAGGTAAAAT TTTAATTGAC AATGTTGATA TAAGTCAGCT GGGACTTTTC GA -            #TTTAAGAA   4380                                                                 - - GAAAATTAGC CATCATTCCA CAAGATCCAG TATTATTTAG GGGTACGATT CG -            #CAAGAACT   4440                                                                 - - TAGATCCATT TAATGAGCGT ACAGATGACG AATTATGGGA TGCATTGGTG AG -            #AGGTGGTG   4500                                                                 - - CTATCGCCAA GGATGACTTG CCGGAAGTGA AATTGCAAAA ACCTGATGAA AA -            #TGGTACTC   4560                                                                 - - ATGGTAAAAT GCATAAGTTC CATTTAGATC AAGCAGTGGA AGAAGAGGGC TC -            #CAATTTCT   4620                                                                 - - CCTTAGGTGA GAGACAACTA TTAGCATTAA CAAGGGCATT GGTCCGCCAA TC -            #AAAAATAT   4680                                                                 - - TGATTTTGGA TGAGGCTACA TCCTCAGTGG ACTACGAAAC GGATGGCAAA AT -            #CCAAACAC   4740                                                                 - - GTATTGTTGA GGAATTTGGA GATTGTACAA TTTTGTGTAT TGCTCACAGA CT -            #GAAGACCA   4800                                                                 - - TTGTAAATTA TGATCGTATT CTTGTTTTAG AGAAGGGTGA AGTCGCAGAA TT -            #CGATACAC   4860                                                                 - - CATGGACGTT GTTTAGTCAA GAAGATAGTA TTTTCAGAAG CATGTGTTCT AG -            #ATCTGGTA   4920                                                                 - - TTGTGGAAAA TGATTTCGAG AACAGAAGTT AATTTATATT ATTTGTTGCA TG -            #ATTTTTCT   4980                                                                 - - CTTTTATTTA TTTATATGTT GCCGATGGTA CAAATTAGTA CTAGAAAAGA AA -            #ACCCACTA   5040                                                                 - - CTATGACTTG CAGAAAAAGT TATGTGTGCC ATAGATAGAT ATAATTGCAT AC -            #CCACATCG   5100                                                                 - - TATACTCAAA ATTCCGAAAA GAACATTTCA TTTTTTATGA GGCAAACTGA AC -            #AACGCTTC   5160                                                                 - - GGTCCTTTTT TCATTCTAGA AATATATATT TATACATCAT TTTCAGAAGA TA -            #TTCAAAGA   5220                                                                 - - ACTTATTGGG ATGTCTATTT ACTGAATAAA GTATACACAA AAAACGAATT TA -            #AAATGGAA   5280                                                                 - - GGCATAAATA GAAAACTTAG AAGTGAAAAT CCTAAAACCG AAGGATATTT CA -            #AATACGTA   5340                                                                 - -  - - (2) INFORMATION FOR SEQ ID NO:10:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH:  1477                                                             (B) TYPE:  amino aci - #d                                                     (C) STRANDEDNESS:  sing - #le                                                 (D) TOPOLOGY:  linear                                                - -     (ii) MOLECULE TYPE: peptide                                           - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:                              - - Met Thr Ile Thr Val Gly Asp Ala Val Ser Gl - #u Thr Glu Leu Glu                           - #5                  - #10                  - #15           - - Asn Lys Ser Gln Asn Val Val Leu Ser Pro Ly - #s Ala Ser Ala Ser                           20 - #                 25 - #                 30              - - Ser Asp Ile Ser Thr Asp Val Asp Lys Asp Th - #r Ser Ser Ser Trp                           35 - #                 40 - #                 45              - - Asp Asp Lys Ser Leu Leu Pro Thr Gly Glu Ty - #r Ile Val Asp Arg                           50 - #                 55 - #                 60              - - Asn Lys Pro Gln Thr Tyr Leu Asn Ser Asp As - #p Ile Glu Lys Val                           65 - #                 70 - #                 75              - - Thr Glu Ser Asp Ile Phe Pro Gln Lys Arg Le - #u Phe Ser Phe Leu                           80 - #                 85 - #                 90              - - His Ser Lys Lys Ile Pro Glu Val Pro Gln Th - #r Asp Asp Glu Arg                           95 - #                100 - #                105              - - Lys Ile Tyr Pro Leu Phe His Thr Asn Ile Il - #e Ser Asn Met Phe                          110  - #               115  - #               120              - - Phe Trp Trp Val Leu Pro Ile Leu Arg Val Gl - #y Tyr Lys Arg Thr                          125  - #               130  - #               135              - - Ile Gln Pro Asn Asp Leu Phe Lys Met Asp Pr - #o Arg Met Ser Ile                          140  - #               145  - #               150              - - Glu Thr Leu Tyr Asp Asp Phe Glu Lys Asn Me - #t Ile Tyr Tyr Phe                          155  - #               160  - #               165              - - Glu Lys Thr Arg Lys Lys Tyr Arg Lys Arg Hi - #s Pro Glu Ala Thr                          170  - #               175  - #               180              - - Glu Glu Glu Val Met Glu Asn Ala Lys Leu Pr - #o Lys His Thr Val                          185  - #               190  - #               195              - - Leu Arg Ala Leu Leu Phe Thr Phe Lys Lys Gl - #n Tyr Phe Met Ser                          200  - #               205  - #               210              - - Ile Val Phe Ala Ile Leu Ala Asn Cys Thr Se - #r Gly Phe Asn Pro                          215  - #               220  - #               225              - - Met Ile Thr Lys Arg Leu Ile Glu Phe Val Gl - #u Glu Lys Ala Ile                          230  - #               235  - #               240              - - Phe His Ser Met His Val Asn Lys Gly Ile Gl - #y Tyr Ala Ile Gly                          245  - #               250  - #               255              - - Ala Cys Leu Met Met Phe Val Asn Gly Leu Th - #r Phe Asn His Phe                          260  - #               265  - #               270              - - Phe His Thr Ser Gln Leu Thr Gly Val Gln Al - #a Lys Ser Ile Leu                          275  - #               280  - #               285              - - Thr Lys Ala Ala Met Lys Lys Met Phe Asn Al - #a Ser Asn Tyr Ala                          290  - #               295  - #               300              - - Arg His Cys Phe Pro Asn Gly Lys Val Thr Se - #r Phe Val Thr Thr                          305  - #               310  - #               315              - - Asp Leu Ala Arg Ile Glu Phe Ala Leu Ser Ph - #e Gln Pro Phe Leu                          320  - #               325  - #               330              - - Ala Gly Phe Pro Ala Ile Leu Ala Ile Cys Il - #e Val Leu Leu Ile                          335  - #               340  - #               345              - - Val Asn Leu Gly Pro Ile Ala Leu Val Gly Il - #e Gly Ile Phe Phe                          350  - #               355  - #               360              - - Gly Gly Phe Phe Ile Ser Leu Phe Ala Phe Ly - #s Leu Ile Leu Gly                          365  - #               370  - #               375              - - Phe Arg Ile Ala Ala Asn Ile Phe Thr Asp Al - #a Arg Val Thr Met                          380  - #               385  - #               390              - - Met Arg Glu Val Leu Asn Asn Ile Lys Met Il - #e Lys Tyr Tyr Thr                          395  - #               400  - #               405              - - Trp Glu Asp Ala Tyr Glu Lys Asn Ile Gln As - #p Ile Arg Thr Lys                          410  - #               415  - #               420              - - Glu Ile Ser Lys Val Arg Lys Met Gln Leu Se - #r Arg Asn Phe Leu                          425  - #               430  - #               435              - - Ile Ala Met Ala Met Ser Leu Pro Ser Ile Al - #a Ser Leu Val Thr                          440  - #               445  - #               450              - - Phe Leu Ala Met Tyr Lys Val Asn Lys Gly Gl - #y Arg Gln Pro Gly                          455  - #               460  - #               465              - - Asn Ile Phe Ala Ser Leu Ser Leu Phe Gln Va - #l Leu Ser Leu Gln                          470  - #               475  - #               480              - - Met Phe Phe Leu Pro Ile Ala Ile Gly Thr Gl - #y Ile Asp Met Ile                          485  - #               490  - #               495              - - Ile Gly Leu Gly Arg Leu Gln Ser Leu Leu Gl - #u Ala Pro Glu Asp                          500  - #               505  - #               510              - - Asp Pro Asn Gln Met Ile Glu Met Lys Pro Se - #r Pro Gly Phe Asp                          515  - #               520  - #               525              - - Pro Lys Leu Ala Leu Lys Met Thr His Cys Se - #r Phe Glu Trp Glu                          530  - #               535  - #               540              - - Asp Tyr Glu Leu Asn Asp Ala Ile Glu Glu Al - #a Lys Gly Glu Ala                          545  - #               550  - #               555              - - Lys Asp Glu Gly Lys Lys Asn Lys Lys Lys Ar - #g Lys Asp Thr Trp                          560  - #               565  - #               570              - - Gly Lys Pro Ser Ala Ser Thr Asn Lys Ala Ly - #s Arg Leu Asp Asn                          575  - #               580  - #               585              - - Met Leu Lys Asp Arg Asp Gly Pro Glu Asp Le - #u Glu Lys Thr Ser                          590  - #               595  - #               600              - - Phe Arg Gly Phe Lys Asp Leu Asn Phe Asp Il - #e Lys Lys Gly Glu                          605  - #               610  - #               615              - - Phe Ile Met Ile Thr Gly Pro Ile Gly Thr Gl - #y Lys Ser Ser Leu                          620  - #               625  - #               630              - - Leu Asn Ala Met Ala Gly Ser Met Arg Lys Il - #e Asp Gly Lys Val                          635  - #               640  - #               645              - - Glu Val Asn Gly Asp Leu Leu Met Cys Gly Ty - #r Pro Trp Ile Gln                          650  - #               655  - #               660              - - Asn Ala Ser Val Arg Asp Asn Ile Ile Phe Gl - #y Ser Pro Phe Asn                          665  - #               670  - #               675              - - Lys Glu Lys Tyr Asp Glu Val Val Arg Val Cy - #s Ser Leu Lys Ala                          680  - #               685  - #               690              - - Asp Leu Asp Ile Leu Pro Ala Gly Asp Met Th - #r Glu Ile Gly Glu                          695  - #               700  - #               705              - - Arg Gly Ile Thr Leu Ser Gly Gly Gln Lys Al - #a Arg Ile Asn Leu                          710  - #               715  - #               720              - - Ala Arg Ser Val Tyr Lys Lys Lys Asp Ile Ty - #r Val Phe Asp Asp                          725  - #               730  - #               735              - - Val Leu Ser Ala Val Asp Ser Arg Val Gly Ly - #s His Ile Met Asp                          740  - #               745  - #               750              - - Glu Cys Leu Thr Gly Met Leu Ala Asn Lys Th - #r Arg Ile Leu Ala                          755  - #               760  - #               765              - - Thr His Gln Leu Ser Leu Ile Glu Arg Ala Se - #r Arg Val Ile Val                          770  - #               775  - #               780              - - Leu Gly Thr Asp Gly Gln Val Asp Ile Gly Th - #r Val Asp Glu Leu                          785  - #               790  - #               795              - - Lys Ala Arg Asn Gln Thr Leu Ile Asn Leu Le - #u Gln Phe Ser Ser                          800  - #               805  - #               810              - - Gln Asn Ser Glu Lys Glu Asp Glu Glu Gln Gl - #u Ala Val Val Ser                          815  - #               820  - #               825              - - Gly Glu Leu Gly Gln Leu Lys Tyr Glu Pro Gl - #u Val Lys Glu Leu                          830  - #               835  - #               840              - - Thr Glu Leu Lys Lys Lys Ala Thr Glu Met Se - #r Gln Thr Ala Asn                          845  - #               850  - #               855              - - Ser Gly Lys Ile Val Ala Asp Gly His Thr Se - #r Ser Lys Glu Glu                          860  - #               865  - #               870              - - Arg Ala Val Asn Ser Ile Ser Leu Lys Ile Ty - #r Arg Glu Tyr Ile                          875  - #               880  - #               885              - - Lys Ala Ala Val Gly Lys Trp Gly Phe Ile Al - #a Leu Pro Leu Tyr                          890  - #               895  - #               900              - - Ala Ile Leu Val Val Gly Thr Thr Phe Cys Se - #r Leu Phe Ser Ser                          905  - #               910  - #               915              - - Val Trp Leu Ser Tyr Trp Thr Glu Asn Lys Ph - #e Lys Asn Arg Pro                          920  - #               925  - #               930              - - Pro Ser Phe Tyr Met Gly Leu Tyr Ser Phe Ph - #e Val Phe Ala Ala                          935  - #               940  - #               945              - - Phe Ile Phe Met Asn Gly Gln Phe Thr Ile Le - #u Cys Ala Met Gly                          950  - #               955  - #               960              - - Ile Met Ala Ser Lys Trp Leu Asn Leu Arg Al - #a Val Lys Arg Ile                          965  - #               970  - #               975              - - Leu His Thr Pro Met Ser Tyr Ile Asp Thr Th - #r Pro Leu Gly Arg                          980  - #               985  - #               990              - - Ile Leu Asn Arg Phe Thr Lys Asp Thr Asp Se - #r Leu Asp Asn Glu                          995  - #              1000   - #             1005              - - Leu Thr Glu Ser Leu Arg Leu Met Thr Ser Gl - #n Phe Ala Asn Ile                         1010  - #              1015   - #             1020              - - Val Gly Val Cys Val Met Cys Ile Val Tyr Le - #u Pro Trp Phe Ala                         1025  - #              1030   - #             1035              - - Ile Ala Ile Pro Phe Leu Leu Val Ile Phe Va - #l Leu Ile Ala Asp                         1040  - #              1045   - #             1050              - - His Tyr Gln Ser Ser Gly Arg Glu Ile Lys Ar - #g Leu Glu Ala Val                         1055  - #              1060   - #             1065              - - Gln Arg Ser Phe Val Tyr Asn Asn Leu Asn Gl - #u Val Leu Gly Gly                         1070  - #              1075   - #             1080              - - Met Asp Thr Ile Lys Ala Tyr Arg Ser Gln Gl - #u Arg Phe Leu Ala                         1085  - #              1090   - #             1095              - - Lys Ser Asp Phe Leu Ile Asn Lys Met Asn Gl - #u Ala Gly Tyr Leu                         1100  - #              1105   - #             1110              - - Val Val Val Leu Gln Arg Trp Val Gly Ile Ph - #e Leu Asp Met Val                         1115  - #              1120   - #             1125              - - Ala Ile Ala Phe Ala Leu Ile Ile Thr Leu Le - #u Cys Val Thr Arg                         1130  - #              1135   - #             1140              - - Ala Phe Pro Ile Ser Ala Ala Ser Val Gly Va - #l Leu Leu Thr Tyr                         1145  - #              1150   - #             1155              - - Val Leu Gln Leu Pro Gly Leu Leu Asn Thr Il - #e Leu Arg Ala Met                         1160  - #              1165   - #             1170              - - Thr Gln Thr Glu Asn Asp Met Asn Ser Ala Gl - #u Arg Leu Val Thr                         1175  - #              1180   - #             1185              - - Tyr Ala Thr Glu Leu Pro Leu Glu Ala Ser Ty - #r Arg Lys Pro Glu                         1190  - #              1195   - #             1200              - - Met Thr Pro Pro Glu Ser Trp Pro Ser Met Gl - #y Glu Ile Ile Phe                         1205  - #              1210   - #             1215              - - Glu Asn Val Asp Phe Ala Tyr Arg Pro Gly Le - #u Pro Ile Val Leu                         1220  - #              1225   - #             1230              - - Lys Asn Leu Asn Leu Asn Ile Lys Ser Gly Gl - #u Lys Ile Gly Ile                         1235  - #              1240   - #             1245              - - Cys Gly Arg Thr Gly Ala Gly Lys Ser Thr Il - #e Met Ser Ala Leu                         1250  - #              1255   - #             1260              - - Tyr Arg Leu Asn Glu Leu Thr Ala Gly Lys Il - #e Leu Ile Asp Asn                         1265  - #              1270   - #             1275              - - Val Asp Ile Ser Gln Leu Gly Leu Phe Asp Le - #u Arg Arg Lys Leu                         1280  - #              1285   - #             1290              - - Ala Ile Ile Pro Gln Asp Pro Val Leu Phe Ar - #g Gly Thr Ile Arg                         1295  - #              1300   - #             1305              - - Lys Asn Leu Asp Pro Phe Asn Glu Arg Thr As - #p Asp Glu Leu Trp                         1310  - #              1315   - #             1320              - - Asp Ala Leu Val Arg Gly Gly Ala Ile Ala Ly - #s Asp Asp Leu Pro                         1325  - #              1330   - #             1335              - - Glu Val Lys Leu Gln Lys Pro Asp Glu Asn Gl - #y Thr His Gly Lys                         1340  - #              1345   - #             1350              - - Met His Lys Phe His Leu Asp Gln Ala Val Gl - #u Glu Glu Gly Ser                         1355  - #              1360   - #             1365              - - Asn Phe Ser Leu Gly Glu Arg Gln Leu Leu Al - #a Leu Thr Arg Ala                         1370  - #              1375   - #             1380              - - Leu Val Arg Gln Ser Lys Ile Leu Ile Leu As - #p Glu Ala Thr Ser                         1385  - #              1390   - #             1395              - - Ser Val Asp Tyr Glu Thr Asp Gly Lys Ile Gl - #n Thr Arg Ile Val                         1400  - #              1405   - #             1410              - - Glu Glu Phe Gly Asp Cys Thr Ile Leu Cys Il - #e Ala His Arg Leu                         1415  - #              1420   - #             1425              - - Lys Thr Ile Val Asn Tyr Asp Arg Ile Leu Va - #l Leu Glu Lys Gly                         1430  - #              1435   - #             1440              - - Glu Val Ala Glu Phe Asp Thr Pro Trp Thr Le - #u Phe Ser Gln Glu                         1445  - #              1450   - #             1455              - - Asp Ser Ile Phe Arg Ser Met Cys Ser Arg Se - #r Gly Ile Val Glu                         1460  - #              1465   - #             1470              - - Asn Asp Phe Glu Asn Arg Ser                                                             1475                                                            - -  - - (2) INFORMATION FOR SEQ ID NO:11:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH:  26                                                               (B) TYPE:  nucleic a - #cid                                                   (C) STRANDEDNESS:  sing - #le                                                 (D) TOPOLOGY:  linear                                                - -     (ii) MOLECULE TYPE: Other nucleic acid - #(synthetic DNA)             - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:                              - - TTTGGTTAYA TGAAYYTNTT YGGNGT          - #                  - #                  26                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:12:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH:  29                                                               (B) TYPE:  nucleic a - #cid                                                   (C) STRANDEDNESS:  sing - #le                                                 (D) TOPOLOGY:  linear                                                - -     (ii) MOLECULE TYPE: Other nucleic acid - #(synthetic DNA)             - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:                              - - TCTACAAART ARTGGTGNGT NARRTACAT         - #                  - #                29                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:13:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH:  2274                                                             (B) TYPE:  nucleic a - #cid                                                   (C) STRANDEDNESS:  doub - #le                                                 (D) TOPOLOGY:  linear                                                - -     (ii) MOLECULE TYPE: genomic DNA                                       - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:                              - - TTATATATAT TATTGATTTG TTCCTGTTGT TATTTAGTTT AGAATCAGAC GA -             #CTACACCA     60                                                                 - - GAACCACAAT TCAACCAACA CTTATATAGA ACCTGGCTTG GAAAAAAGTA AC -            #ATTTATCA    120                                                                 - - TTCCTATACT TTTTTAGCAA ACATAATCCG TGTTTTACAT ATATTATTCA CC -            #CAATATCA    180                                                                 - - TAACAAAAAC AAACTGAATA ATGGCGTCTT CTATTTTGCG TTCCAAAATA AT -            #ACAAAAAC    240                                                                 - - CGTACCAATT ATTCCACTAC TATTTTCTTC TGGAGAAGGC TCCTGGTTCT AC -            #AGTTAGTG    300                                                                 - - ATTTGAATTT TGATACAAAC ATACAAACGA GTTTACGTAA ATTAAAGCAT CA -            #TCATTGGA    360                                                                 - - CGGTGGGAGA AATATTCCAT TATGGGTTTT TGGTTTCCAT ACTTTTTTTC GT -            #GTTTGTGG    420                                                                 - - TTTTCCCAGC TTCATTTTTT ATAAAATTAC CAATAATCTT AGCATTTGCT AC -            #TTGTTTTT    480                                                                 - - TAATACCCTT AACATCACAA TTTTTTCTTC CTGCCTTGCC CGTTTTCACT TG -            #GTTGGCAT    540                                                                 - - TATATTTTAC GTGTGCTAAA ATACCTCAAG AATGGAAACC AGCTATCACA GT -            #TAAAGTTT    600                                                                 - - TACCAGCTAT GGAAACAATT TTGTACGGCG ATAATTTATC AAATGTTTTG GC -            #AACCATCA    660                                                                 - - CTACCGGAGT GTTAGATATA TTGGCATGGT TACCATATGG GATTATTCAT TT -            #CAGTTTCC    720                                                                 - - CATTTGTACT TGCTGCTATT ATATTTTTAT TTGGGCCACC GACGGCATTA AG -            #ATCATTTG    780                                                                 - - GATTTGCCTT TGGTTATATG AACTTGCTTG GAGTCTTGAT TCAAATGGCA TT -            #CCCAGCTG    840                                                                 - - CTCCTCCATG GTACAAAAAC TTGCACGGAT TAGAACCAGC TAATTATTCA AT -            #GCACGGGT    900                                                                 - - CTCCTGGTGG ACTTGGAAGG ATAGATAAAT TGTTAGGTGT TGATATGTAT AC -            #CACAGGGT    960                                                                 - - TTTCCAATTC ATCAATCATT TTTGGGGCAT TCCCATCGTT ACATTCAGGA TG -            #TTGTATCA   1020                                                                 - - TGGAAGTGTT ATTTTTGTGT TGGTTGTTTC CACGATTCAA GTTTGTGTGG GT -            #TACATACG   1080                                                                 - - CATCTTGGCT TTGGTGGAGC ACGATGTATT TGACCCATCA CTACTTTGTC GA -            #TTTGATTG   1140                                                                 - - GTGGAGCCAT GCTATCTTTG ACTGTTTTTG AGTTCACCAA ATATAAATAT TT -            #GCCAAAAA   1200                                                                 - - ACAAAGAAGG CCTTTTCTGT CGTTGGTCAT ACACTGAAAT TGAAAAAATC GA -            #TATCCAAG   1260                                                                 - - AGATTGACCC TTTATCATAC AATTATATCC CTGTCAACAG CAATGATAAT GA -            #AAGCAGAT   1320                                                                 - - TGTATACGAG AGTGTACCAA GAGTCTCAGG TTAGTCCCCC ACAGAGAGCT GA -            #AACACCTG   1380                                                                 - - AAGCATTTGA GATGTCAAAT TTTTCTAGGT CTAGACAAAG CTCAAAGACT CA -            #GGTTCCAT   1440                                                                 - - TGAGTAATCT TACTAACAAT GATCAAGTGT CTGGAATTAA CGAAGAGGAT GA -            #AGAAGAAG   1500                                                                 - - AAGGCGATGA AATTTCATCG AGTACTCCTT CGGTGTTTGA AGACGAACCA CA -            #GGGTAGCA   1560                                                                 - - CATATGCTGC ATCCTCAGCT ACATCAGTAG ATGATTTGGA TTCCAAAAGA AA -            #TTAGTAAA   1620                                                                 - - ATAACAGTTT CTATTAATTT CTTTATTTCC TCCTAATTAA TGATTTTATG CT -            #CAATACCT   1680                                                                 - - ACACTATCTG TTTTTAATTT CCTACTTTTT TTTTATTATT GTTGAGTTCA TT -            #TGCTGTTC   1740                                                                 - - ATTGAATATT TACAATTTTG CATTAATTAC CATCAATATA GAATGGGCAC AG -            #TTTTTTTA   1800                                                                 - - AGTTTTTTTG TTTTTGTGTT TGTCTTTCTT TTTTTACATT AATGTGTTTG GA -            #TTGTTTTA   1860                                                                 - - GGTTCCTTTA TCCCTTAGCC CCCTCAGAAT ACTATTTTAT CTAATTAATT TG -            #TTTTTATT   1920                                                                 - - TTCTGATATT TACCAATTGC TTTTTCTTTT GGATATTTAT AATAGCATCC CC -            #TAATAATT   1980                                                                 - - AATATACAAC TGTTTCATAT ATATACGTGT ATGTCCTGTA GTGGTGGAAA CT -            #GGAGTCAA   2040                                                                 - - CATTTGTATT AATGTGTACA AGAAAGCAGT GTTAATGCTA CTATTATAAT TT -            #TTGAGGTG   2100                                                                 - - CAAATCAAGA GGTTGGCAGC TTTCTTATGG CTATGACCGT GAATGAAGGC TT -            #GTAAACCA   2160                                                                 - - CGTAATAAAC AAAAGCCAAC AAGTTTTTTT AGAGCCTTTA ACAACATACG CA -            #ATGAGAGT   2220                                                                 - - GATTGCAATA CTACAAGATA TAGCCCAAAA AATTGAATGC ATTTCAACAA CA - #AC             2274                                                                       - -  - - (2) INFORMATION FOR SEQ ID NO:14:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH:  471                                                              (B) TYPE:  amino aci - #d                                                     (C) STRANDEDNESS:  sing - #le                                                 (D) TOPOLOGY:  linear                                                - -     (ii) MOLECULE TYPE: peptide                                           - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:                              - - Met Ala Ser Ser Ile Leu Arg Ser Lys Ile Il - #e Gln Lys Pro Tyr                            - #5                  - #10                  - #15           - - Gln Leu Phe His Tyr Tyr Phe Leu Ser Glu Ly - #s Ala Pro Gly Ser                           20 - #                 25 - #                 30              - - Thr Val Ser Asp Leu Asn Phe Asp Thr Asn Il - #e Gln Thr Ser Leu                           35 - #                 40 - #                 45              - - Arg Lys Leu Lys His His His Trp Thr Val Gl - #y Glu Ile Phe His                           50 - #                 55 - #                 60              - - Tyr Gly Phe Leu Val Ser Ile Leu Phe Phe Va - #l Phe Val Val Phe                           65 - #                 70 - #                 75              - - Pro Ala Ser Phe Phe Ile Lys Leu Pro Ile Il - #e Leu Ala Phe Ala                           80 - #                 85 - #                 90              - - Thr Cys Phe Leu Ile Pro Leu Thr Ser Gln Ph - #e Phe Leu Pro Ala                           95 - #                100 - #                105              - - Leu Pro Val Phe Thr Trp Leu Ala Leu Tyr Ph - #e Thr Cys Ala Lys                          110  - #               115  - #               120              - - Ile Pro Gln Glu Trp Lys Pro Ala Ile Thr Va - #l Lys Val Leu Pro                          125  - #               130  - #               135              - - Ala Met Glu Thr Ile Leu Tyr Gly Asp Asn Le - #u Ser Asn Val Leu                          140  - #               145  - #               150              - - Ala Thr Ile Thr Thr Gly Val Leu Asp Ile Le - #u Ala Trp Leu Pro                          155  - #               160  - #               165              - - Tyr Gly Ile Ile His Phe Ser Phe Pro Phe Va - #l Leu Ala Ala Ile                          170  - #               175  - #               180              - - Ile Phe Leu Phe Gly Pro Pro Thr Ala Leu Ar - #g Ser Phe Gly Phe                          185  - #               190  - #               195              - - Ala Phe Gly Tyr Met Asn Leu Leu Gly Val Le - #u Ile Gln Met Ala                          200  - #               205  - #               210              - - Phe Pro Ala Ala Pro Pro Trp Tyr Lys Asn Le - #u His Gly Leu Glu                          215  - #               220  - #               225              - - Pro Ala Asn Tyr Ser Met His Gly Ser Pro Gl - #y Gly Leu Gly Arg                          230  - #               235  - #               240              - - Ile Asp Lys Leu Leu Gly Val Asp Met Tyr Th - #r Thr Gly Phe Ser                          245  - #               250  - #               255              - - Asn Ser Ser Ile Ile Phe Gly Ala Phe Pro Se - #r Leu His Ser Gly                          260  - #               265  - #               270              - - Cys Cys Ile Met Glu Val Leu Phe Leu Cys Tr - #p Leu Phe Pro Arg                          275  - #               280  - #               285              - - Phe Lys Phe Val Trp Val Thr Tyr Ala Ser Tr - #p Leu Trp Trp Ser                          290  - #               295  - #               300              - - Thr Met Tyr Leu Thr His His Tyr Phe Val As - #p Leu Ile Gly Gly                          305  - #               310  - #               315              - - Ala Met Leu Ser Leu Thr Val Phe Glu Phe Th - #r Lys Tyr Lys Tyr                          320  - #               325  - #               330              - - Leu Pro Lys Asn Lys Glu Gly Leu Phe Cys Ar - #g Trp Ser Tyr Thr                          335  - #               340  - #               345              - - Glu Ile Glu Lys Ile Asp Ile Gln Glu Ile As - #p Pro Leu Ser Tyr                          350  - #               355  - #               360              - - Asn Tyr Ile Pro Val Asn Ser Asn Asp Asn Gl - #u Ser Arg Leu Tyr                          365  - #               370  - #               375              - - Thr Arg Val Tyr Gln Glu Ser Gln Val Ser Pr - #o Pro Gln Arg Ala                          380  - #               385  - #               390              - - Glu Thr Pro Glu Ala Phe Glu Met Ser Asn Ph - #e Ser Arg Ser Arg                          395  - #               400  - #               405              - - Gln Ser Ser Lys Thr Gln Val Pro Leu Ser As - #n Leu Thr Asn Asn                          410  - #               415  - #               420              - - Asp Gln Val Ser Gly Ile Asn Glu Glu Asp Gl - #u Glu Glu Glu Gly                          425  - #               430  - #               435              - - Asp Glu Ile Ser Ser Ser Thr Pro Ser Val Ph - #e Glu Asp Glu Pro                          440  - #               445  - #               450              - - Gln Gly Ser Thr Tyr Ala Ala Ser Ser Ala Th - #r Ser Val Asp Asp                          455  - #               460  - #               465              - - Leu Asp Ser Lys Arg Asn                                                                  470                                                            - -  - - (2) INFORMATION FOR SEQ ID NO:15:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH:  243                                                              (B) TYPE:  nucleic a - #cid                                                   (C) STRANDEDNESS:  doub - #le                                                 (D) TOPOLOGY:  linear                                                - -     (ii) MOLECULE TYPE: genomic DNA                                       - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:                              - - TTTGAAAAAT TTGAATTTTA AAATTAATCC AATGGAAAAA ATTGGTATTT GT -             #GGAAGAAC     60                                                                 - - CGGTGCTGGT AAATCATCAA TTATGACAGC ATTATATCGA TTATCAGAAT TA -            #GAACTGGG    120                                                                 - - GAAAATTATT ATTGATGATA TTGATATTTC AACTTTGGGT TTAAAAGATC TT -            #CGATCAAA    180                                                                 - - ATTATCAATT ATTCCTCAAG ATCCAGTATT ATTCCGAGGT TCAATTCGGA AA -            #AACTTGGA    240                                                                 - - TCC                  - #                  - #                  - #                243                                                                  - -  - - (2) INFORMATION FOR SEQ ID NO:16:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH:  80                                                               (B) TYPE:  amino aci - #d                                                     (C) STRANDEDNESS:  sing - #le                                                 (D) TOPOLOGY:  linear                                                - -     (ii) MOLECULE TYPE: peptide                                           - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:                              - - Leu Lys Asn Leu Asn Phe Lys Ile Asn Pro Me - #t Glu Lys Ile Gly                            - #5                  - #10                  - #15           - - Ile Cys Gly Arg Thr Gly Ala Gly Lys Ser Se - #r Ile Met Thr Ala                           20 - #                 25 - #                 30              - - Leu Tyr Arg Leu Ser Glu Leu Glu Leu Gly Ly - #s Ile Ile Ile Asp                           35 - #                 40 - #                 45              - - Asp Ile Asp Ile Ser Thr Leu Gly Leu Lys As - #p Leu Arg Ser Lys                           50 - #                 55 - #                 60              - - Leu Ser Ile Ile Pro Gln Asp Pro Val Leu Ph - #e Arg Gly Ser Ile                           65 - #                 70 - #                 75              - - Arg Lys Asn Leu Asp                                                                       80                                                            - -  - - (2) INFORMATION FOR SEQ ID NO:17:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH:  1601                                                             (B) TYPE:  nucleic a - #cid                                                   (C) STRANDEDNESS:  doub - #le                                                 (D) TOPOLOGY:  linear                                                - -     (ii) MOLECULE TYPE: genomic DNA                                       - -     (iv) ANTI-SENSE: Yes                                                  - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:                              - - AGGAAGATGA CTTGCATCAA AGATGGAGGA AGTGGTACTG GCAGGACGAT CA -             #ATCAAATC     60                                                                 - - AGCAGCAGGA CTAGGTAACG GCTCAGGTGA TGATGAACCC ACGGACCATT CA -            #TGATCGGT    120                                                                 - - GTTAGCAAGT TCCATATTGT TAAGACCACT CATGAAGGCT ACTGCATTAG GG -            #TTTTGAGT    180                                                                 - - AAAAGAATCC CTTCCAAGTA AGTATGGGCT GCCGGTACGA GCCAAGGAGT TG -            #CTGGTTTT    240                                                                 - - TTCGGAAAGA CCATGACCGT GGATAACAAA CTCGTATTCC CAACGAAGGA TT -            #TTACCAGT    300                                                                 - - TTGCAACTGT GGGAGGCGTA GCTTTTGAGC AAAAACGAAG CATATAATAG CT -            #AAACACAT    360                                                                 - - ACCGCCGACC AAATCTACAA AGTAGTGGTG GGTAAGGTAC ATAGTACACC AG -            #CAAAGCCA    420                                                                 - - TAGAACATAT CCATAAAAGC AGAAGCGGTA TCGAGGAAAC ACATGCGAAA GG -            #AAAAGTGC    480                                                                 - - TTCCAGCATG GCCCATCCAG CGTGAAGAGA TGGAAAGGCA CCAAAAACAA CC -            #GGAGAGTT    540                                                                 - - AGAAAAACCA TCAGTGTAAA TGCTAGTGCC GAAGAGAGCA TCAATACGGG CC -            #AATCCACC    600                                                                 - - AGGAGAGCCA CGTACTGCAT ACGTGGCAGG TTCTAAACCA TACATATTTT CA -            #TACCAAGG    660                                                                 - - AGGAGAACAG GGGAAAGCCA TTTGGATAAG AACACCAAAT AAATTCATAT AA -            #CCAAAAGT    720                                                                 - - TCGAGCCCAA ACTGGAAGAG TTCCAGGAGG TGCAAAGATG AAAAGAATAA AT -            #GAAATGAT    780                                                                 - - AAAAGGAGCC GAATAATGCA TGACTCCATA TGGAACCCAG GCCAAAATAT CA -            #AGGATGCT    840                                                                 - - ATGCGTGGTT TTCGAGAGAA GACTAGAAAG ATTAGAGCCA TAAAGAATAT TT -            #TCAAGTGT    900                                                                 - - GGGTAAAACA CGAACCCATA TGGGTGGACG CCAGCGTTCT GGAATAAACC TA -            #CAAGAGTA    960                                                                 - - AAATAAAATT GCCCAGGTGA TGATAACAAT GGCAGGAAAA AAAATTTGGC GT -            #GTTAAAGG   1020                                                                 - - AACGGTCAAC GCAATGGCCA AAAGACAGGC AATGCCAAAT TTCCCCCAGA AT -            #CCAGGAGA   1080                                                                 - - TTCAATGACA ATACAAGCAA AAATCAAATT ACCTGCTAGA AACACATATT GC -            #AAATGTGT   1140                                                                 - - CCATGACCAT TTCGTATTGC GTAGCAAACG AAATGTAGGC ATAGGGTTTA AG -            #CTTGTTTC   1200                                                                 - - CAACTTGTAT TGGGATGCTC GGTTACACGC AGCAAGGCGC TTTTTTAAGG TC -            #GAAAGAGC   1260                                                                 - - AGACATTGCT TCAAAGAATT ATCAGAGTAA AAAAGGGAAG CGTACGAAAA AA -            #ATTTCGTA   1320                                                                 - - AGGAATTAAC CGGAAAACTA AAGGAAAAAA AAGGAATTTT TATGAAGGAA AG -            #AAAGTAGC   1380                                                                 - - TATTAAATGC AAGTGTCAAG CACTTAAAAG TAGCGATGTA AAATATTTAA AA -            #AAAGATGG   1440                                                                 - - ACCGATTAAC CAATGTTCAG CTCACAGTTG CCAGCAATCA GGGCTATTTT TT -            #TATTTTTT   1500                                                                 - - TTATAAAATT GCTAATTATA TATAATATAA TTAGTTTATT AACTTGCTTT TC -            #CTCAAAAA   1560                                                                 - - ACCAATTCGA GAAAGGAACT TTTGCAGAGG CAAAAAAGCT T    - #                      - # 1601                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:18:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH:  1601                                                             (B) TYPE:  nucleic a - #cid                                                   (C) STRANDEDNESS:  doub - #le                                                 (D) TOPOLOGY:  linear                                                - -     (ii) MOLECULE TYPE: mRNA                                              - -     (iv) ANTI-SENSE: Yes                                                  - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:                              - - AGGAAGAUGA CUUGCAUCAA AGAUGGAGGA AGUGGUACUG GCAGGACGAU CA -             #AUCAAAUC     60                                                                 - - AGCAGCAGGA CUAGGUAACG GCUCAGGUGA UGAUGAACCC ACGGACCAUU CA -            #UGAUCGGU    120                                                                 - - GUUAGCAAGU UCCAUAUUGU UAAGACCACU CAUGAAGGCU ACUGCAUUAG GG -            #UUUUGAGU    180                                                                 - - AAAAGAAUCC CUUCCAAGUA AGUAUGGGCU GCCGGUACGA GCCAAGGAGU UG -            #CUGGUUUU    240                                                                 - - UUCGGAAAGA CCAUGACCGU GGAUAACAAA CUCGUAUUCC CAACGAAGGA UU -            #UUACCAGU    300                                                                 - - UUGCAACUGU GGGAGGCGUA GCUUUUGAGC AAAAACGAAG CAUAUAAUAG CU -            #AAACACAU    360                                                                 - - ACCGCCGACC AAAUCUACAA AGUAGUGGUG GGUAAGGUAC AUAGUACACC AG -            #CAAAGCCA    420                                                                 - - UAGAACAUAU CCAUAAAAGC AGAAGCGGUA UCGAGGAAAC ACAUGCGAAA GG -            #AAAAGUGC    480                                                                 - - UUCCAGCAUG GCCCAUCCAG CGUGAAGAGA UGGAAAGGCA CCAAAAACAA CC -            #GGAGAGUU    540                                                                 - - AGAAAAACCA UCAGUGUAAA UGCUAGUGCC GAAGAGAGCA UCAAUACGGG CC -            #AAUCCACC    600                                                                 - - AGGAGAGCCA CGUACUGCAU ACGUGGCAGG UUCUAAACCA UACAUAUUUU CA -            #UACCAAGG    660                                                                 - - AGGAGAACAG GGGAAAGCCA UUUGGAUAAG AACACCAAAU AAAUUCAUAU AA -            #CCAAAAGU    720                                                                 - - UCGAGCCCAA ACUGGAAGAG UUCCAGGAGG UGCAAAGAUG AAAAGAAUAA AU -            #GAAAUGAU    780                                                                 - - AAAAGGAGCC GAAUAAUGCA UGACUCCAUA UGGAACCCAG GCCAAAAUAU CA -            #AGGAUGCU    840                                                                 - - AUGCGUGGUU UUCGAGAGAA GACUAGAAAG AUUAGAGCCA UAAAGAAUAU UU -            #UCAAGUGU    900                                                                 - - GGGUAAAACA CGAACCCAUA UGGGUGGACG CCAGCGUUCU GGAAUAAACC UA -            #CAAGAGUA    960                                                                 - - AAAUAAAAUU GCCCAGGUGA UGAUAACAAU GGCAGGAAAA AAAAUUUGGC GU -            #GUUAAAGG   1020                                                                 - - AACGGUCAAC GCAAUGGCCA AAAGACAGGC AAUGCCAAAU UUCCCCCAGA AU -            #CCAGGAGA   1080                                                                 - - UUCAAUGACA AUACAAGCAA AAAUCAAAUU ACCUGCUAGA AACACAUAUU GC -            #AAAUGUGU   1140                                                                 - - CCAUGACCAU UUCGUAUUGC GUAGCAAACG AAAUGUAGGC AUAGGGUUUA AG -            #CUUGUUUC   1200                                                                 - - CAACUUGUAU UGGGAUGCUC GGUUACACGC AGCAAGGCGC UUUUUUAAGG UC -            #GAAAGAGC   1260                                                                 - - AGACAUUGCU UCAAAGAAUU AUCAGAGUAA AAAAGGGAAG CGUACGAAAA AA -            #AUUUCGUA   1320                                                                 - - AGGAAUUAAC CGGAAAACUA AAGGAAAAAA AAGGAAUUUU UAUGAAGGAA AG -            #AAAGUAGC   1380                                                                 - - UAUUAAAUGC AAGUGUCAAG CACUUAAAAG UAGCGAUGUA AAAUAUUUAA AA -            #AAAGAUGG   1440                                                                 - - ACCGAUUAAC CAAUGUUCAG CUCACAGUUG CCAGCAAUCA GGGCUAUUUU UU -            #UAUUUUUU   1500                                                                 - - UUAUAAAAUU GCUAAUUAUA UAUAAUAUAA UUAGUUUAUU AACUUGCUUU UC -            #CUCAAAAA   1560                                                                 - - ACCAAUUCGA GAAAGGAACU UUUGCAGAGG CAAAAAAGCU U    - #                      - # 1601                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:19:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH:  12                                                               (B) TYPE:  amino aci - #d                                                     (C) STRANDEDNESS:  sing - #le                                                 (D) TOPOLOGY:  linear                                                - -     (ii) MOLECULE TYPE: peptide                                           - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:                              - - Cys Phe Thr Ser Ser Tyr Phe Pro Asp Asp Ar - #g Arg                                        - #     5             - #     10                             - -  - - (2) INFORMATION FOR SEQ ID NO:20:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH:  19                                                               (B) TYPE:  amino aci - #d                                                     (C) STRANDEDNESS:  sing - #le                                                 (D) TOPOLOGY:  linear                                                - -     (ii) MOLECULE TYPE: peptide                                           - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:                              - - Cys Tyr Thr Ser Ile Glu Lys Tyr Asp Ile Se - #r Lys Ser Asp Pro                            - #5                  - #10                  - #15           - - Leu Ala Ala Asp                                                           - -  - - (2) INFORMATION FOR SEQ ID NO:21:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH:  1553                                                             (B) TYPE:  nucleic a - #cid                                                   (C) STRANDEDNESS:  doub - #le                                                 (D) TOPOLOGY:  linear                                                - -     (ii) MOLECULE TYPE: Genomic DNA                                       - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21:                              - - TTTTACATAT ATTATTCACC CAATATCATA ACAAAAACAA ACTGAATGAT GG -             #CATCTTCT     60                                                                 - - ATTTTGCGTT CCAAAATAAT ACAAAAACCG TACCAATTAT TCCACTACTA TT -            #TTCTTCTG    120                                                                 - - GAGAAGGCTC CTGGTTCTAC AGTTAGTGAT TTGAATTTTG ATACAAACAT AC -            #AAACGAGT    180                                                                 - - TTACGTAAAT TAAAGCATCA TCATTGGACG GTGGGAGAAA TATTCCATTA TG -            #GGTTTTTG    240                                                                 - - GTTTCCATAC TTTTTTTCGT GTTTGTGGTT TTCCCAGCTT CATTTTTTAT AA -            #AATTACCA    300                                                                 - - ATAATCTTAG CATTTGCTAC TTGTTTTTTA ATACCCTTAA CATCACAATT TT -            #TTCTTCCT    360                                                                 - - GCCTTGCCCG TTTTCACTTG GTTGGCATTA TATTTTACGT GTGCTAAAAT AC -            #CTCAAGAA    420                                                                 - - TGGAAACCAG CTATCACAGT TAAAGTTTTA CCAGCTATGG AAACAATTTT GT -            #ACGGCGAT    480                                                                 - - AATTTATCAA ATGTTTTGGC AACCATCACT ACCGGAGTGT TAGATATATT GG -            #CATGGTTA    540                                                                 - - CCATATGGGA TTATTCATTT CAGTTTCCCA TTTGTACTTG CTGCTATTAT AT -            #TTTTATTT    600                                                                 - - GGGCCACCGA CGGCATTAAG ATCATTTGGA TTTGCCTTTG GTTATATGAA CT -            #TGCTTGGA    660                                                                 - - GTCTTGATTC AAATGGCATT CCCAGCTGCT CCTCCATGGT ACAAAAACTT GC -            #ACGGATTA    720                                                                 - - GAACCAGCTA ATTATTCAAT GCACGGGTCT CCTGGTGGAC TTGGAAGGAT AG -            #ATAAATTG    780                                                                 - - TTAGGTGTTG ATATGTATAC CACAGGGTTT TCCAATTCAT CAATCATTTT TG -            #GGGCATTC    840                                                                 - - CCATCGTTAC ATTCAGGATG TTGTATCATG GAAGTGTTAT TTTTGTGTTG GT -            #TGTTTCCA    900                                                                 - - CGATTCAAGT TTGTGTGGGT TACATACGCA TCTTGGCTTT GGTGGAGCAC GA -            #TGTATTTG    960                                                                 - - ACCCATCACT ACTTTGTCGA TTTGATTGGT GGAGCCATGC TATCTTTGAC TG -            #TTTTTGAA   1020                                                                 - - TTCACCAAAT ATAAATATTT GCCAAAAAAC AAAGAAGGCC TTTTCTGTCG TT -            #GGTCATAC   1080                                                                 - - ACTGAAATTG AAAAAATCGA TATCCAAGAG ATTGACCCTT TATCATACAA TT -            #ATATCCCT   1140                                                                 - - GTCAACAGCA ATGATAATGA AAGCAGATTG TATACGAGAG TGTACCAAGA GC -            #CTCAGGTT   1200                                                                 - - AGTCCCCCAC AGAGAGCTGA AACACCTGAA GCATTTGAGA TGTCAAATTT TT -            #CTAGGTCT   1260                                                                 - - AGACAAAGCT CAAAGACTCA GGTTCCATTG AGTAATCTTA CTAACAATGA TC -            #AAGTGCCT   1320                                                                 - - GGAATTAACG AAGAGGATGA AGAAGAAGAA GGCGATGAAA TTTCGTCGAG TA -            #CTCCTTCG   1380                                                                 - - GTGTTTGAAG ACGAACCACA GGGTAGCACA TATGCTGCAT CCTCAGCTAC AT -            #CAGTAGAT   1440                                                                 - - GATTTGGATT CCAAAAGAAA TTAGTAAAAC AGCAGTTTCT ATTAATTTCT TT -            #ATTTCCTC   1500                                                                 - - CTAATTAATG ATTTTATGTT CAATACCTAC ACTATCTGTT TTTAATTTCC TA - #C              1553                                                                       - -  - - (2) INFORMATION FOR SEQ ID NO:22:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH:  472                                                              (B) TYPE:  amino aci - #d                                                     (C) STRANDEDNESS:  sing - #le                                                 (D) TOPOLOGY:  linear                                                - -     (ii) MOLECULE TYPE: peptide                                           - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22:                              - - Met Met Ala Ser Ser Ile Leu Arg Ser Lys Il - #e Ile Gln Lys Pro            1               5 - #                 10 - #                 15              - - Tyr Gln Leu Phe His Tyr Tyr Phe Leu Leu Gl - #u Lys Ala Pro Gly                           20 - #                 25 - #                 30              - - Ser Thr Val Ser Asp Leu Asn Phe Asp Thr As - #n Ile Gln Thr Ser                           35 - #                 40 - #                 45              - - Leu Arg Lys Leu Lys His His His Trp Thr Va - #l Gly Glu Ile Phe                           50 - #                 55 - #                 60              - - His Tyr Gly Phe Leu Val Ser Ile Leu Phe Ph - #e Val Phe Val Val                           65 - #                 70 - #                 75              - - Phe Pro Ala Ser Phe Phe Ile Lys Leu Pro Il - #e Ile Leu Ala Phe                           80 - #                 85 - #                 90              - - Ala Thr Cys Phe Leu Ile Pro Leu Thr Ser Gl - #n Phe Phe Leu Pro                           95 - #                100 - #                105              - - Ala Leu Pro Val Phe Thr Trp Leu Ala Leu Ty - #r Phe Thr Cys Ala                          110  - #               115  - #               120              - - Lys Ile Pro Gln Glu Trp Lys Pro Ala Ile Th - #r Val Lys Val Leu                          125  - #               130  - #               135              - - Pro Ala Met Glu Thr Ile Leu Tyr Gly Asp As - #n Leu Ser Asn Val                          140  - #               145  - #               150              - - Leu Ala Thr Ile Thr Thr Gly Val Leu Asp Il - #e Leu Ala Trp Leu                          155  - #               160  - #               165              - - Pro Tyr Gly Ile Ile His Phe Ser Phe Pro Ph - #e Val Leu Ala Ala                          170  - #               175  - #               180              - - Ile Ile Phe Leu Phe Gly Pro Pro Thr Ala Le - #u Arg Ser Phe Gly                          185  - #               190  - #               195              - - Phe Ala Phe Gly Tyr Met Asn Leu Leu Gly Va - #l Leu Ile Gln Met                          200  - #               205  - #               210              - - Ala Phe Pro Ala Ala Pro Pro Trp Tyr Lys As - #n Leu His Gly Leu                          215  - #               220  - #               225              - - Glu Pro Ala Asn Tyr Ser Met His Gly Ser Pr - #o Gly Gly Leu Gly                          230  - #               235  - #               240              - - Arg Ile Asp Lys Leu Leu Gly Val Asp Met Ty - #r Thr Thr Gly Phe                          245  - #               250  - #               255              - - Ser Asn Ser Ser Ile Ile Phe Gly Ala Phe Pr - #o Ser Leu His Ser                          260  - #               265  - #               270              - - Gly Cys Cys Ile Met Glu Val Leu Phe Leu Cy - #s Trp Leu Phe Pro                          275  - #               280  - #               285              - - Arg Phe Lys Phe Val Trp Val Thr Tyr Ala Se - #r Trp Leu Trp Trp                          290  - #               295  - #               300              - - Ser Thr Met Tyr Leu Thr His His Tyr Phe Va - #l Asp Leu Ile Gly                          305  - #               310  - #               315              - - Gly Ala Met Leu Ser Leu Thr Val Phe Glu Ph - #e Thr Lys Tyr Lys                          320  - #               325  - #               330              - - Tyr Leu Pro Lys Asn Lys Glu Gly Leu Phe Cy - #s Arg Trp Ser Tyr                          335  - #               340  - #               345              - - Thr Glu Ile Glu Lys Ile Asp Ile Gln Glu Il - #e Asp Pro Leu Ser                          350  - #               355  - #               360              - - Tyr Asn Tyr Ile Pro Val Asn Ser Asn Asp As - #n Glu Ser Arg Leu                          365  - #               370  - #               375              - - Tyr Thr Arg Val Tyr Gln Glu Pro Gln Val Se - #r Pro Pro Gln Arg                          380  - #               385  - #               390              - - Ala Glu Thr Pro Glu Ala Phe Glu Met Ser As - #n Phe Ser Arg Ser                          395  - #               400  - #               405              - - Arg Gln Ser Ser Lys Thr Gln Val Pro Leu Se - #r Asn Leu Thr Asn                          410  - #               415  - #               420              - - Asn Asp Gln Val Pro Gly Ile Asn Glu Glu As - #p Glu Glu Glu Glu                          425  - #               430  - #               435              - - Gly Asp Glu Ile Ser Ser Ser Thr Pro Ser Va - #l Phe Glu Asp Glu                          440  - #               445  - #               450              - - Pro Gln Gly Ser Thr Tyr Ala Ala Ser Ser Al - #a Thr Ser Val Asp                          455  - #               460  - #               465              - - Asp Leu Asp Ser Lys Arg Asn                                                              470                                                          __________________________________________________________________________

We claim:
 1. An isolated protein which regulates aureobasidinsensitivity comprising the amino acid sequence of SEQ ID No. 2, 4, 6, 8,10, 14, 16 or
 22. 2. An isolated protein consisting of an amino acidsequence which is encoded by the nucleotide sequence of SEQ ID No. 1, 3,5, 7, 9, 13, 15 or
 21. 3. The protein of claim 1, which is isolated froma fungus.
 4. The protein of claim 2, which is isolated from a fungus. 5.The protein of claim 1, wherein the protein is isolated from a cell typeselected from the group consisting of Candida albicans, Candida albicansvar. stellatoidea, Candida tropicalis, Candida kefyr, Candidaparapsilosis, Candida krusei, Candida guilliermondii, Candida glabrata,Cryptococcus neoformans, Cryptococcus terreus, Rhodotorula rubra,Aspergillus fumigatus, Aspergillus clavatus, Aspergillus nidulans,Aspergillus terreus, Penicillium commune, Trichophyton mentagrophytes,Epidermophyton floccosum, Fonsecaea pedrosoi, Exophiala werneckii,Cladosporium bantianum, Histoplasma capsulatum, Paracoccidioidesbrasiliensis, Geotrichum candidum, Blastomyces dermatitidis,Schizosaccharomyces pombe, Saccharomyces cerevisiae, mouse lymphoma EL-4and mouse lymphoma L5178Y.
 6. The protein of claim 2, wherein theprotein is isolated from a cell type selected from the group consistingof Candida albicans, Candida albicans var. stellatoidea, Candidatropicalis, Candida kefyr, Candida parapsilosis, Candida krusei, Candidaguilliermondii, Candida glabrata, Cryptococcus neoformans, Cryptococcusterreus, Rhodotorula rubra, Aspergillus fumigatus, Aspergillus clavatus,Aspergillus nidulans, Aspergillus terreus, Penicillium commune,Trichophyton mentagrophytes, Epidermophyton floccosum, Fonsecaeapedrosoi, Exophiala werneckii, Cladosporium bantianum, Histoplasmacapsulatum, Paracoccidioides brasiliensis, Geotrichum candidum,Blastomyces dermatitidis, Schizosaccharomyces pombe, Saccharomycescerevisiae, mouse lymphoma EL-4 and mouse lymphoma L5178Y.